Awesome

CVPR 2024 | 3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling

<a href='[https://jiangchaokang.github.io/3DSFLabelling-Page/](https://jiangchaokang.github.io/3DSFLabelling-Page/)' style='padding-left: 0.5rem;'> <img src='https://img.shields.io/badge/Project-Page-blue?style=flat&logo=Google%20chrome&logoColor=blue' alt='Project Page'> </a>

GitHub stars GitHub contributors GitHub issues Downloads

Check out the project demo here: 3DSFLabelling-Page

The code is gradually being released, please be patient.

[poster coming soon] [video coming soon]

Description	Simple Example of the Auto-Labelling
The proposed 3D scene flow pseudo-auto-labelling framework. Given point clouds and initial bounding boxes, both global and local motion parameters are iteratively optimized. Diverse motion patterns are augmented by randomly adjusting these motion parameters, thereby creating a diverse and realistic set of motion labels for the training of flow estimation models.

</div>

Highlights <a name="highlights"></a>

:fire: GENERATE FULLY ALIGNED INTER-FRAME POINT CLOUDS

We propose a new framework for the automatic labelling of 3D scene flow pseudo-labels, significantly enhancing the accuracy of current scene flow estimation models, and effectively addressing the scarcity of 3D flow labels in autonomous driving.

:fire

:star2: Plug-and-play & Novel motion augmentation

We propose a universal 3D box optimization method with multiple motion attributes. Building upon this, we further introduce a plug-and-play 3D scene flow augmentation module with global-local motions and motion status. This allows for flexible motion adjustment of ego-motion and dynamic environments, setting a new benchmark for scene flow data augmentation.

News <a name="news"></a>

[2024/4] :fire: Open source example model (GMSF, MSBRN and FLOT), Training and evaluation code.
[2024/4] :fire: Open sourced a new data set with 3D scene flow GT, The data comes from KITTI Odometry Dataset, Argoverse Dataset and nuScenes Dataset respectively.
[2024/4] :fire: Open sourced the code for the 3D Scene Flow Label Generation
[2024/03] 3DSFLabelling code and models initially released.
[2024/02] 3DSFLabelling is accepted by CVPR 2024.
[2024/02] 3DSFLabelling paper released.

TODO List <a name="TODO List"></a>

Still in progress:

Datasets are easier to use.
The validity of the generated labels is verified on motion segmentation and LiDAR odometry.
Readability optimization of configuration files and data reading code sections.

Results and Model Zoo
License and Citation
Comparative Results

Results and Model Zoo <a name="models"></a>

Method	Dataset	Pre-trained Model	EPE3D
GMSF	lidarKITTI	gmsf_lidarKITTI_epe0.008.pth	0.008
GMSF	Argoverse	gmsf_argoverse_epe0.013.pth	0.013
GMSF	nuScenes	gmsf_nuScene_epe0.018.pth	0.018
FLOT	lidarKITTI	flot_lidarKITTI_epe0.018.tar	0.018
FLOT	Argoverse	flot_argoverse_epe0.043.tar	0.043
FLOT	nuScenes	flot_nuScenes_epe0.061.tar	0.061
MSBRN	lidarKITTI	msbrn_lidarKITTI_epe0.011.pth	0.0110
MSBRN	Argoverse	msbrn_argoverse_epe0.017.pth	0.017
MSBRN	nuScenes	msbrn_nuScenes_epe0.076.pth	0.076

Comparative results <a name="Comparative"></a>

The comparative results between our method and baseline. "↑" signifies accuracy enhancement. In real-world LiDAR scenarios, our method markedly improves the 3D flow estimation accuracy across three datasets on the three baselines. This demonstrates that the proposed pseudo-auto-labelling framework can substantially boost the accuracy of existing methods, even without the need for ground truth.

Dataset	Method	EPE3D↓	Acc3DS↑	Acc3DR↑
	<span style="background-color:#000000">FLOT [1]</span>	<span style="background-color:#000000">0.6532</span>	<span style="background-color:#000000">0.1554</span>	<span style="background-color:#000000">0.3130</span>
	FLOT+3DSFlabelling	0.0189 ↑97.1%	0.9666	0.9792
	<span style="background-color:#000000">MSBRN [2]</span>	<span style="background-color:#000000">0.0139</span>	<span style="background-color:#000000">0.9752</span>	<span style="background-color:#000000">0.9847</span>
<div style="text-align:left">LiDAR<br>KITTI</div>	MSBRN+3DSFlabelling	0.0123 ↑11.5%	0.9797	0.9868
	<span style="background-color:#000000">GMSF [3]</span>	<span style="background-color:#000000">0.1900</span>	<span style="background-color:#000000">0.2962</span>	<span style="background-color:#000000">0.5502</span>
	GMSF+3DSFlabelling	0.0078 ↑95.8%	0.9924	0.9947

Dataset	Method	EPE3D↓	Acc3DS↑	Acc3DR↑
	<span style="background-color:#000000">FLOT [1]</span>	<span style="background-color:#000000">0.2491</span>	<span style="background-color:#000000">0.0946</span>	<span style="background-color:#000000">0.3126</span>
	FLOT+3DSFlabelling	0.0107 ↑95.7%	0.9711	0.9862
Argoverse	<span style="background-color:#000000">MSBRN [2]</span>	<span style="background-color:#000000">0.8691</span>	<span style="background-color:#000000">0.2432</span>	<span style="background-color:#000000">0.2854</span>
	MSBRN+3DSFlabelling	0.0150 ↑98.3%	0.9482	0.9601
	<span style="background-color:#000000">GMSF [3]</span>	<span style="background-color:#000000">7.2776</span>	<span style="background-color:#000000">0.0036</span>	<span style="background-color:#000000">0.0144</span>
	GMSF+3DSFlabelling	0.0093 ↑99.9%	0.9780	0.9880

Dataset	Method	EPE3D↓	Acc3DS↑	Acc3DR↑
	<span style="background-color:#000000">FLOT [1]</span>	<span style="background-color:#000000">0.4858</span>	<span style="background-color:#000000">0.0821</span>	<span style="background-color:#000000">0.2669</span>
	FLOT+3DSFlabelling	0.0554 ↑88.6%	0.7601	0.8909
nuScenes	<span style="background-color:#000000">MSBRN [2]</span>	<span style="background-color:#000000">0.6137</span>	<span style="background-color:#000000">0.2354</span>	<span style="background-color:#000000">0.2924</span>
	MSBRN+3DSFlabelling	0.0235 ↑96.2%	0.9413	0.9604
	<span style="background-color:#000000">GMSF [3]</span>	<span style="background-color:#000000">9.4231</span>	<span style="background-color:#000000">0.0034</span>	<span style="background-color:#000000">0.0086</span>
	GMSF+3DSFlabelling	0.0185 ↑99.8%	0.9534	0.9713

<a id="1">[1]</a> Puy G, Boulch A, Marlet R. Flot: Scene flow on point clouds guided by optimal transport[C]//European conference on computer vision. Cham: Springer International Publishing, 2020: 527-544.
<a id="2">[2]</a> Cheng W, Ko J H. Multi-scale bidirectional recurrent network with hybrid correlation for point cloud based scene flow estimation[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2023: 10041-10050.
<a id="3">[3]</a> Zhang Y, Edstedt J, Wandt B, et al. Gmsf: Global matching scene flow[J]. Advances in Neural Information Processing Systems, 2024, 36.

License and Citation <a name="license-and-citation"></a>

All assets and code are under the Apache 2.0 license unless specified otherwise.

If this work is helpful for your research, please consider citing the following BibTeX entry.

@inproceedings{yang2023vidar,
  title={3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling},
  author={Jiang, Chaokang and Wang, Guangming and Liu, Jiuming and Wang, Hesheng and Ma, Zhuang and Liu, Zhenqiang and Liang, Zhujin and Shan, Yi and Du, Dalong},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year={2024}
}