Home

Awesome

3D Object Detection with Pointformer

This repository contains the code for the paper 3D Object Detection with Pointformer (CVPR 2021) [arXiv]. This work is developed on the top of MMDetection3D toolbox and includes the models and results on SUN RGB-D and ScanNet datasets in the paper.

Overall Structure

More models results on KITTI and nuScenes datasets will be released soon.

Installation and Usage

The code is developed with MMDetection3D v0.6.1 and works well with v0.14.0.

Dependencies

Installation

  1. Install dependencies following their guidelines.
  2. Clone and install mmdet3d in develop mode.
git clone https://github.com/open-mmlab/mmdetection3d.git
cd mmdetection3d
python setup.py develop
  1. Add the files in this repo into the directories in mmdet3d.

Training and Testing

Download the pretrained weights from Google Drive or Tsinghua Cloud and put them in the checkpoints folder. Use votenet_ptr_sunrgbd-3d-10class as an example:

# Training
bash -x tools/dist_train.sh configs/pointformer/votenet_ptr_sunrgbd-3d-10class.py 8

# Testing 
bash tools/dist_test.sh configs/pointformer/votenet_ptr_sunrgbd-3d-10class.py checkpoints/votenet_ptr_sunrgbd-3d-10class.pth 8 --eval mAP

Results

SUN RGB-D

classesAP_0.25AR_0.25AP_0.50AR_0.50
bed0.83430.95150.55560.7029
table0.53530.87050.23440.4604
sofa0.65880.91710.49790.6715
chair0.76810.87000.56640.6703
toilet0.91170.99310.55380.7103
desk0.24580.80500.07540.3395
dresser0.36260.80280.23570.4908
night_stand0.67010.90200.45250.6196
bookshelf0.33830.68090.09680.2624
bathtub0.78210.89800.42590.5510
Overall0.61070.86910.36940.5479

ScanNet

classesAP_0.25AR_0.25AP_0.50AR_0.50
cabinet0.45480.79300.17570.4435
bed0.88390.95060.80060.8889
chair0.90110.93860.75620.8136
sofa0.89150.97940.66190.8041
table0.67630.87140.48580.6971
door0.54130.72160.21070.4283
window0.48210.70210.15040.2979
bookshelf0.52550.87010.44220.7273
picture0.18150.36490.07480.1351
counter0.62100.86540.23330.3846
desk0.68590.93700.37740.6535
curtain0.55220.79100.31560.4627
refrigerator0.52150.96490.40280.7193
showercurtrain0.67090.96430.19410.5000
toilet0.99221.00000.82100.8793
sink0.63610.73470.41190.5000
bathtub0.87100.87100.83750.8387
garbagebin0.47620.72640.22440.4604
Overall0.64250.83590.42090.5908

For more details of experimetns please refer to the paper.

Acknowledgement

This code is based on MMDetection3D.

Citation

If you find our work is useful in your research, please consider citing:

@InProceedings{Pan_2021_CVPR,
    author    = {Pan, Xuran and Xia, Zhuofan and Song, Shiji and Li, Li Erran and Huang, Gao},
    title     = {3D Object Detection With Pointformer},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {7463-7472}
}