Awesome
CMKD: Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (ECCV 2022 Oral)
Paper
Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (arXiv, Supplimentary Included)
Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (ECCV Open Access)
Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (Waymo Challenge Report)
If you find our papers helpful for your research, you may cite our paper as
@inproceedings{yuhong-CMKD-ECCV2022,
author = {Yu Hong and
Hang Dai and
Yong Ding},
title = {Cross-Modality Knowledge
Distillation Network for Monocular 3D Object
Detection},
booktitle = {{ECCV}},
series = {Lecture Notes in Computer Science},
publisher = {Springer},
year = {2022}
}
Introduction
This is the official implementation of CMKD with OpenPCDet for KITTI and Waymo datasets.
We have also implemented another version with MMDetection3D for Nuscenes dataset.
News
[2023.3.25] We have several updates.
- Release the multi-camera version CMKD for Nuscenes dataset, refer to this link for detail
- Release more pre-trained models for KITTI dataset
[2023.2.14] We have several updates.
Notice: Due to the short schedule, instructions and pre-trained models will be released gradually in the near future, and there may be many issues and bugs, please feel free to let us know if you have any questions.
- Support center head in response distillation
- Support more teacher models in the framework (We now have SECOND, CenterPoint and PointPillar), more pre-trained models will be released later
- Support to set different feature level as the distillation guidance
- Add visualization utils to visualize the BEV feature maps and the detection results
- Support for Nuscenes dataset will be released very soon
[2022.11.20] We release some instructions and pre-trained models covering the KITTI experiments.
This implementation has some differences from our paper, but the core idea is the same. Overall, the current version is faster to train, uses less memory, and has similar performance to the older version.
Waymo experiments and Nuscenes experiments are on the way.
[2022.7.9] Our paper has been accepted by ECCV 2022 as Oral presentation. :fire::fire::fire:
[2022.7.4] Our paper has been accepted by ECCV 2022. :fire::fire:
[2022.5.24] CMKD gets the 3rd place in the 2022 Waymo 3D camera-only detection challenge. :fire:
In the challenge, we simply extend our baseline model from single-camera version to multi-camera version without any challenge-specific skills and achieve good results. Specifically, we use a lightweight res-50 backbone with 20% of the total training samples, no previous frames, no data augmentation, and no training and testing tricks to rank 3rd in the challenge.
Framework Overview
BEV Features Generation
Use CMKD
Installation
Please follow INSTALL to install CMKD.
Getting Started
Please follow GETTING_START to train or evaluate the models.
Models
KITTI
Teacher Model | Car Easy@R40 | Car Moderate@R40 | Car Hard@R40 | Model | Teacher Model | |
---|---|---|---|---|---|---|
CMKD-R50 (kitti train + eigen clean) | SECOND | 33.36 | 21.61 | 17.97 | model | model |
CMKD-R50 (kitti train) | SECOND | 24.02 | 15.80 | 13.22 | model | model |
CMKD-R50 (kitti train + eigen clean) | CenterPoint | 29.78 | 21.17 | 18.41 | model | model |
CMKD-R50 (kitti train) | CenterPoint | 22.56 | 16.02 | 13.52 | model | model |
CMKD-R50 (kitti train + eigen clean) | PointPillar | 32.25 | 21.47 | 18.21 | model | model |
CMKD-R50 (kitti train) | PointPillar | 23.84 | 16.44 | 13.58 | model | model |
Waymo
Coming Soon
Nuscenes
mAP | NDS | Model | |
---|---|---|---|
BEVDet-R50 | 30.7 | 38.2 | - |
BEVDet-R50 + CMKD | 34.7 | 42.6 | - |