Home

Awesome

Towards Efficient 3D Object Detection with Knowledge Distillation


Jihan Yang, Shaoshuai Shi, Runyu Ding, Zhe Wang, Xiaojuan Qi

This repository contains the official implementation of Towards Efficient 3D Object Detection with Knowledge Distillation, NeurIPS 2022

Changelog

[2022-09-] Code release for Waymo results.

Introduction

Our code is based on OpenPCDet v0.5.2. More updates on OpenPCDet are supposed to be compatible with our code.

Model Zoo

Waymo Open Dataset

Basic Models without KD

Similar to OpenPCDet, all models in the following table are trained with a single frame of 20% data (~32k frames) of all the training samples on 8 GTX 1080Ti GPUs. Note that the validation are also carried on the 20% validation set with DATA_CONFIG.SAMPLED_INTERVAL.test 5 (similar performance with 100% validation set).

Model (20% data)LEVEL2 mAPHFlops (G)Acts (M)Latency (ms)
CP-Voxel64.29114.8101.9125.70
CP-Voxel-S62.2347.865.797.99
CP-Voxel-XS61.1436.958.488.19
CP-Voxel-XXS56.2612.033.170.44
CP-Pillar59.09333.9303.0157.90
CP-Pillar-v0.457.55212.9197.7103.37
CP-Pillar-v0.4856.27149.4142.381.87
CP-Pillar-v0.6452.8185.188.054.52

Sparse Distillation Models

Model (20% data)LEVEL2 mAPHGains
CP-Voxel-S + SparseKD64.25+2.0
CP-Voxel-XS + SparseKD63.53+2.4
CP-Voxel-XXS + SparseKD59.28+3.0
CP-Pillar-v0.4 + SparseKD59.24+1.7
CP-Pillar-v0.48 + SparseKD58.53+2.3
CP-Pillar-v0.64 + SparseKD55.82+3.0

Here we also provide the performance of several models trained on the full training set and validate on the full validation set.

Model (100% data)LEVEL2 mAPH
CP-Voxel65.58
CP-Pillar61.56
PV-RCNN++69.46
CP-Voxel-S + SparseKD65.75
CP-Voxel-XS + SparseKD64.83
CP-Voxel-XXS + SparseKD60.93
CP-Pillar-v0.4 + SparseKD61.60
CP-Pillar-v0.48 + SparseKD60.95
CP-Pillar-v0.64 + SparseKD58.89

Cross stage distillation

Model (20% data)LEVEL2 mAPHFlops (G)Acts (M)Latency (ms)
PV-RCNN++67.80123.5179.7435.9
CP-Voxel64.29114.8101.9125.7
CP-Voxel + SparseKd65.27114.8101.9125.7

We could not publicly provide the above pretrained models due to Waymo Dataset License Agreement. To access these pretrained models, please email us your name, institute, a screenshot of the Waymo dataset registration confirmation mail, and your intended usage. Please send a second email if we don't get back to you in two days. Please note that Waymo open dataset is under strict non-commercial license, so we are not allowed to share the model with you if it will use for any profit-oriented activities.

Latency on Different Hardware

latency

To facilitate further researches, we provide latency measured in milliseconds as follows:

Model1060 + Spconv2.x1080Ti + Spconv2.xA100 + Spconv2.x
SECOND84.5650.6540.44
PointPillar129.1268.6144.14
CP-Voxel125.7074.2556.13
CP-Voxel-S97.9965.6839.40
CP-Voxel-XS88.1961.4136.65
CP-Voxel-XXS70.4453.8638.01
CP-Pillar157.9077.2951.94
CP-Pillar-v0.4103.3756.3257.07
CP-Pillar-v0.4881.8749.6427.44
CP-Pillar-v0.6454.5236.2319.56

Installation

Please refer to INSTALL.md for the installation of OpenPCDet.

Getting Started

Please refer to GETTING_STARTED.md to learn more usage about this project.

License

Our code is released under the Apache 2.0 license.

Acknowledgement

Our code is heavily based on OpenPCDet. Thanks OpenPCDet Development Team for their awesome codebase.

Citation

If you find this project useful in your research, please consider cite:

@inproceedings{yang2022towards,
    title={Towards Efficient 3D Object Detection with Knowledge Distillation},
    author={Yang, Jihan and Shi, Shaoshuai and Ding, Runyu and Wang, Zhe and Qi, Xiaojuan},
    booktitle={Advances in Neural Information Processing Systems},
    year={2022}
}
@misc{openpcdet2020,
    title={OpenPCDet: An Open-source Toolbox for 3D Object Detection from Point Clouds},
    author={OpenPCDet Development Team},
    howpublished = {\url{https://github.com/open-mmlab/OpenPCDet}},
    year={2020}
}