Awesome
DAPS3D: Domain Adaptive Projective Segmentation of 3D LiDAR Point Clouds
Alexey Klokov, Di Un Pak, Aleksandr Khorin, Dmitry Yudin, Leon Kochiev, Vladimir Luchinskiy, Vitaly Bezuglyj
This repo contains the code for our paper DAPS3D: Domain Adaptive Projective Segmentation of 3D LiDAR Point Clouds.
Abstract
LiDARs are one of the key sources of reliable environmental ranging information for autonomous vehicles. However, segmentation of 3D scene elements (roads, buildings, people, cars, etc.) based on LiDAR point clouds has limitations. On the one hand, point- and voxel-based segmentation neural networks do not offer sufficiently high speed. On the other hand, modern labeled datasets primarily consist of street scenes recorded for driverless cars and contain little data for mobile delivery robots or cleaners that must work in parks and yards with heavy pedestrian traffic. This article aims to overcome these limitations. We have proposed a novel approach called DAPS3D to train deep neural networks for 3D semantic segmentation. This approach is based on a spherical projection of a point cloud and LiDAR-specific masks, enabling the model to adapt to different types of LiDAR. First of all, we have introduced various high-speed multi-scale spherical projection segmentation models, including convolutional, recurrent, and transformer architectures. Among them, the SalsaNextRecLSTM model has shown the best results. Secondly, we have proposed several original augmentations for spherical projections of LiDAR data, including FoV, flip, and rotation augmentation, as well as a special T-Zone cutout. These augmentations increase the model's invariance when dealing with changes in the data domain. Finally, we introduce a new method to generate synthetic datasets for domain adaptation problems. We have developed two new datasets for validating 3D scene outdoor segmentation algorithms: the DAPS-1 dataset, which is based on the augmentation of the reconstructed 3D semantic map, and the DAPS-2 LiDAR dataset, collected by the on-board sensors of a cleaning robot in a park area. Particular attention is given to the performance of the developed models, demonstrating their ability to function in real-time.
ToDo List
- Release code
- Document for Installation
- Document for Dataset Preparation
- Document for Testing and Training
- Evaluation
- Release trained models
- Release datasets: DAPS-1 & DAPS-2
- Release SalsaNextRecLSTM model
- Release SalsaNextRecLSTM model
Installation Instructions
- We use conda environments
- We use different Python and PyTorch versions for our models
- For complete installation instructions, please see Installation.
Dataset Preparation
- We release our DAPS-1 and DAPS-2 datasets.
- We also experiment on SemanticKITTI and RELLIS-3D datasets.
- Please see Datasets Preparation for complete instructions for preparing the datasets.
Execution Instructions
Training
- We train all our models using 2 × RTX 2080Ti (11 GB each) GPUs.
- Please see Getting Started for training commands.
Evaluation
- Please see Getting Started for evaluation commands.
Results
You can find our pretrained models in Getting Started.
DAPS-1
All models are trained with augmentation set 5 with T-Zone (see our Paper)
Model | $mIoU$, % | $IoU$[vehicle], % | $IoU$[human], % | $IoU$[surface], % | $IoU$[obstacle], % |
---|---|---|---|---|---|
SalsaNet | 0.867 | <ins>0.880</ins> | 0.646 | 0.989 | 0.953 |
SalsaNetRec | 0.850 | 0.836 | 0.680 | 0.974 | 0.908 |
SalsaNetRecLSTM | 0.862 | 0.878 | 0.632 | 0.988 | 0.949 |
SalsaNext | <ins>0.886</ins> | 0.878 | <ins>0.721</ins> | <ins>0.990</ins> | <ins>0.954</ins> |
SalsaNextRecLSTM | 0.932 | 0.929 | 0.830 | 0.994 | 0.974 |
DDRNet | 0.690 | 0.773 | 0.126 | 0.977 | 0.886 |
DDRNetOC | 0.694 | 0.769 | 0.138 | 0.978 | 0.889 |
DDRNetDA | 0.691 | 0.770 | 0.129 | 0.977 | 0.886 |
Segformer | 0.530 | 0.495 | 0.052 | 0.909 | 0.665 |
<br/><br/>
DAPS-2
These are inference results on DAPS-2 for the models trained on different datasets.
Models trained on SemanticKITTI
Model | $mIoU$, % | $IoU$[human], % | $IoU$[surface], % | $IoU$[obstacle], % |
---|---|---|---|---|
SalsaNet | 0.405 | 0.056 | 0.662 | 0.498 |
SalsaNetRec | 0.267 | 0.034 | 0.478 | 0.290 |
SalsaNetRecLSTM | 0.338 | 0.075 | 0.565 | 0.373 |
SalsaNext | 0.258 | 0.074 | 0.423 | 0.277 |
SalsaNextRecLSTM | 0.327 | 0.039 | 0.548 | <ins>0.394</ins> |
DDRNet | 0.345 | <ins>0.080</ins> | 0.768 | 0.187 |
DDRNetOC | 0.323 | 0.015 | <ins>0.739</ins> | 0.216 |
DDRNetDA | <ins>0.369</ins> | 0.040 | 0.705 | 0.361 |
Segformer | 0.230 | 0.166 | 0.182 | 0.343 |
Models trained on SemanticKITTI + RELLIS-3D
Model | $mIoU$, % | $IoU$[human], % | $IoU$[surface], % | $IoU$[obstacle], % |
---|---|---|---|---|
SalsaNet | <ins>0.712</ins> | 0.733 | 0.760 | <ins>0.643</ins> |
SalsaNetRec | 0.481 | 0.446 | 0.590 | 0.406 |
SalsaNetRecLSTM | 0.689 | 0.770 | <ins>0.761</ins> | 0.537 |
SalsaNext | 0.663 | 0.799 | 0.684 | 0.505 |
SalsaNextRecLSTM | 0.736 | 0.813 | 0.784 | <ins>0.612</ins> |
DDRNet | 0.580 | 0.697 | 0.535 | 0.509 |
DDRNetOC | 0.616 | 0.724 | 0.586 | 0.539 |
DDRNetDA | 0.541 | 0.614 | 0.507 | 0.503 |
Segformer | 0.290 | 0.412 | 0.054 | 0.404 |
Models trained on DAPS-1
Model | $mIoU$, % | $IoU$[human], % | $IoU$[surface], % | $IoU$[obstacle], % |
---|---|---|---|---|
SalsaNet | 0.575 | 0.479 | 0.741 | 0.504 |
SalsaNetRec | 0.609 | 0.596 | <ins>0.748</ins> | 0.482 |
SalsaNetRecLSTM | 0.624 | 0.532 | 0.783 | 0.557 |
SalsaNext | <ins>0.643</ins> | <ins>0.706</ins> | 0.620 | <ins>0.602</ins> |
SalsaNextRecLSTM | 0.759 | 0.709 | 0.746 | 0.823 |
DDRNet | 0.516 | 0.354 | 0.737 | 0.458 |
DDRNetOC | 0.562 | 0.403 | 0.705 | 0.577 |
DDRNetDA | 0.531 | 0.360 | 0.718 | 0.515 |
Segformer | 0.393 | 0.355 | 0.418 | 0.407 |
Citation
If you found DAPS3D useful in your research, please consider starring ⭐ us on GitHub and citing 📚 us in your research!
@article{klokov2023daps3d,
title={DAPS3D: Domain Adaptive Projective Segmentation of 3D LiDAR Point Clouds},
author={Klokov, Alexey and Pak, Di Un and Khorin, Aleksandr and Yudin, Dmitry and Kochiev, Leon and Luchinskiy, Vladimir and Bezuglyj, Vitaly},
journal={IEEE Access},
year={2023},
doi={10.1109/ACCESS.2023.3298706},
publisher={IEEE}
}
Licence
This repository is released under MIT License (see LICENSE file for details).