Awesome

DAPS3D: Domain Adaptive Projective Segmentation of 3D LiDAR Point Clouds

Alexey Klokov, Di Un Pak, Aleksandr Khorin, Dmitry Yudin, Leon Kochiev, Vladimir Luchinskiy, Vitaly Bezuglyj

[pdf] [BibTeX]

This repo contains the code for our paper DAPS3D: Domain Adaptive Projective Segmentation of 3D LiDAR Point Clouds.

Abstract

LiDARs are one of the key sources of reliable environmental ranging information for autonomous vehicles. However, segmentation of 3D scene elements (roads, buildings, people, cars, etc.) based on LiDAR point clouds has limitations. On the one hand, point- and voxel-based segmentation neural networks do not offer sufficiently high speed. On the other hand, modern labeled datasets primarily consist of street scenes recorded for driverless cars and contain little data for mobile delivery robots or cleaners that must work in parks and yards with heavy pedestrian traffic. This article aims to overcome these limitations. We have proposed a novel approach called DAPS3D to train deep neural networks for 3D semantic segmentation. This approach is based on a spherical projection of a point cloud and LiDAR-specific masks, enabling the model to adapt to different types of LiDAR. First of all, we have introduced various high-speed multi-scale spherical projection segmentation models, including convolutional, recurrent, and transformer architectures. Among them, the SalsaNextRecLSTM model has shown the best results. Secondly, we have proposed several original augmentations for spherical projections of LiDAR data, including FoV, flip, and rotation augmentation, as well as a special T-Zone cutout. These augmentations increase the model's invariance when dealing with changes in the data domain. Finally, we introduce a new method to generate synthetic datasets for domain adaptation problems. We have developed two new datasets for validating 3D scene outdoor segmentation algorithms: the DAPS-1 dataset, which is based on the augmentation of the reconstructed 3D semantic map, and the DAPS-2 LiDAR dataset, collected by the on-board sensors of a cleaning robot in a park area. Particular attention is given to the performance of the developed models, demonstrating their ability to function in real-time.

SalsaNetRecAbstract RecBlock

ToDo List

Installation Instructions

We use conda environments
We use different Python and PyTorch versions for our models
For complete installation instructions, please see Installation.

Dataset Preparation

We release our DAPS-1 and DAPS-2 datasets.
We also experiment on SemanticKITTI and RELLIS-3D datasets.
Please see Datasets Preparation for complete instructions for preparing the datasets.

Execution Instructions

Training

We train all our models using 2 × RTX 2080Ti (11 GB each) GPUs.
Please see Getting Started for training commands.

Evaluation

Please see Getting Started for evaluation commands.

Results

You can find our pretrained models in Getting Started.

DAPS-1

DAPS-1_Results

All models are trained with augmentation set 5 with T-Zone (see our Paper)

Model	$mIoU$, %	$IoU$[vehicle], %	$IoU$[human], %	$IoU$[surface], %	$IoU$[obstacle], %
SalsaNet	0.867	<ins>0.880</ins>	0.646	0.989	0.953
SalsaNetRec	0.850	0.836	0.680	0.974	0.908
SalsaNetRecLSTM	0.862	0.878	0.632	0.988	0.949
SalsaNext	<ins>0.886</ins>	0.878	<ins>0.721</ins>	<ins>0.990</ins>	<ins>0.954</ins>
SalsaNextRecLSTM	0.932	0.929	0.830	0.994	0.974
DDRNet	0.690	0.773	0.126	0.977	0.886
DDRNetOC	0.694	0.769	0.138	0.978	0.889
DDRNetDA	0.691	0.770	0.129	0.977	0.886
Segformer	0.530	0.495	0.052	0.909	0.665

DAPS-2

DAPS-2_Domain

These are inference results on DAPS-2 for the models trained on different datasets.

Models trained on SemanticKITTI

Model	$mIoU$, %	$IoU$[human], %	$IoU$[surface], %	$IoU$[obstacle], %
SalsaNet	0.405	0.056	0.662	0.498
SalsaNetRec	0.267	0.034	0.478	0.290
SalsaNetRecLSTM	0.338	0.075	0.565	0.373
SalsaNext	0.258	0.074	0.423	0.277
SalsaNextRecLSTM	0.327	0.039	0.548	<ins>0.394</ins>
DDRNet	0.345	<ins>0.080</ins>	0.768	0.187
DDRNetOC	0.323	0.015	<ins>0.739</ins>	0.216
DDRNetDA	<ins>0.369</ins>	0.040	0.705	0.361
Segformer	0.230	0.166	0.182	0.343

Models trained on SemanticKITTI + RELLIS-3D

Model	$mIoU$, %	$IoU$[human], %	$IoU$[surface], %	$IoU$[obstacle], %
SalsaNet	<ins>0.712</ins>	0.733	0.760	<ins>0.643</ins>
SalsaNetRec	0.481	0.446	0.590	0.406
SalsaNetRecLSTM	0.689	0.770	<ins>0.761</ins>	0.537
SalsaNext	0.663	0.799	0.684	0.505
SalsaNextRecLSTM	0.736	0.813	0.784	<ins>0.612</ins>
DDRNet	0.580	0.697	0.535	0.509
DDRNetOC	0.616	0.724	0.586	0.539
DDRNetDA	0.541	0.614	0.507	0.503
Segformer	0.290	0.412	0.054	0.404

Models trained on DAPS-1

Model	$mIoU$, %	$IoU$[human], %	$IoU$[surface], %	$IoU$[obstacle], %
SalsaNet	0.575	0.479	0.741	0.504
SalsaNetRec	0.609	0.596	<ins>0.748</ins>	0.482
SalsaNetRecLSTM	0.624	0.532	0.783	0.557
SalsaNext	<ins>0.643</ins>	<ins>0.706</ins>	0.620	<ins>0.602</ins>
SalsaNextRecLSTM	0.759	0.709	0.746	0.823
DDRNet	0.516	0.354	0.737	0.458
DDRNetOC	0.562	0.403	0.705	0.577
DDRNetDA	0.531	0.360	0.718	0.515
Segformer	0.393	0.355	0.418	0.407

Citation

If you found DAPS3D useful in your research, please consider starring ⭐ us on GitHub and citing 📚 us in your research!

@article{klokov2023daps3d,
  title={DAPS3D: Domain Adaptive Projective Segmentation of 3D LiDAR Point Clouds},
  author={Klokov, Alexey and Pak, Di Un and Khorin, Aleksandr and Yudin, Dmitry and Kochiev, Leon and Luchinskiy, Vladimir and Bezuglyj, Vitaly},
  journal={IEEE Access},
  year={2023},
  doi={10.1109/ACCESS.2023.3298706},
  publisher={IEEE}
}

Licence

This repository is released under MIT License (see LICENSE file for details).