Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
imvotenet_faster-rcnn-r50_fpn_4xb2_sunrgbd-3d.py		imvotenet_faster-rcnn-r50_fpn_4xb2_sunrgbd-3d.py
imvotenet_stage2_8xb16_sunrgbd-3d.py		imvotenet_stage2_8xb16_sunrgbd-3d.py
metafile.yml		metafile.yml

README.md

ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes

ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes

Abstract

3D object detection has seen quick progress thanks to advances in deep learning on point clouds. A few recent works have even shown state-of-the-art performance with just point clouds input (e.g. VOTENET). However, point cloud data have inherent limitations. They are sparse, lack color information and often suffer from sensor noise. Images, on the other hand, have high resolution and rich texture. Thus they can complement the 3D geometry provided by point clouds. Yet how to effectively use image information to assist point cloud based detection is still an open question. In this work, we build on top of VOTENET and propose a 3D detection architecture called IMVOTENET specialized for RGB-D scenes. IMVOTENET is based on fusing 2D votes in images and 3D votes in point clouds. Compared to prior work on multi-modal detection, we explicitly extract both geometric and semantic features from the 2D images. We leverage camera parameters to lift these features to 3D. To improve the synergy of 2D-3D feature fusion, we also propose a multi-tower training scheme. We validate our model on the challenging SUN RGB-D dataset, advancing state-of-the-art results by 5.7 mAP. We also provide rich ablation studies to analyze the contribution of each design choice.

Introduction

We implement ImVoteNet and provide the result and checkpoints on SUNRGBD.

Results and models

SUNRGBD-2D (Stage 1, image branch pre-train)

Backbone	Lr schd	Mem (GB)	Inf time (fps)	[email protected]	[email protected]	Download
PointNet++		2.1			62.70	model \| log

SUNRGBD-3D (Stage 2)

Backbone	Lr schd	Mem (GB)	Inf time (fps)	[email protected]	[email protected]	Download
PointNet++	3x	9.4		64.48		model \| log

Citation

@inproceedings{qi2020imvotenet,
  title={Imvotenet: Boosting 3D object detection in point clouds with image votes},
  author={Qi, Charles R and Chen, Xinlei and Litany, Or and Guibas, Leonidas J},
  booktitle={Proceedings of the IEEE/CVF conference on computer vision and pattern recognition},
  pages={4404--4413},
  year={2020}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

imvotenet

imvotenet

README.md

ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes

Abstract

Introduction

Results and models

SUNRGBD-2D (Stage 1, image branch pre-train)

SUNRGBD-3D (Stage 2)

Citation

Files

imvotenet

Directory actions

More options

Directory actions

More options

Latest commit

History

imvotenet

Folders and files

parent directory

README.md

ImVoteNet: Boosting 3D Object Detection in Point Clouds with Image Votes

Abstract

Introduction

Results and models

SUNRGBD-2D (Stage 1, image branch pre-train)

SUNRGBD-3D (Stage 2)

Citation