Home

Awesome

Novel Class Discovery for 3D Point Cloud Semantic Segmentation [CVPR 2023]

The official implementation of our work "Novel Class Discovery for 3D Point Cloud Semantic Segmentation".

teaser

Introduction

Novel class discovery (NCD) for semantic segmentation is the problem of learning a model that is capable of segmenting unlabelled (novel) classes by using only the supervision from labelled (base) classes. This problem has been recently pioneered for 2D image data, but no work exists for 3D point cloud data. In fact, assumptions made for 2D are loosely applicable to 3D in this case. This paper is thus presented to advance the state of the art on point cloud data analysis in four directions. Firstly, we address the new problem of NCD for point cloud semantic segmentation. Secondly, we show that the transposition of the only existing NCD method for 2D semantic segmentation to 3D data is suboptimal. Thirdly, we present a new method for NCD based on online clustering that exploits uncertainty quantification to produce prototypes for pseudo-labelling the points of the novel classes. Lastly, we introduce a new evaluation protocol to assess the performance of NCD for point cloud semantic segmentation. We thoroughly evaluate our method on SemanticKITTI and SemanticPOSS datasets, showing that it can significantly outperform the baseline.

:fire: For more information have a look at our PAPER! :fire:

Authors: Luigi Riz, Cristiano Saltori, Elisa Ricci, Fabio Poiesi

News :new:

Installation

The code has been tested with Python 3.8, CUDA 11.3, pytorch 1.10.1 and pytorch-lighting 1.4.8. Any other version may require to update the code for compatibility.

Conda

To run the code, you need to install:

Data preparation

To download the data follow the instructions provided by SemanticKITTI and SemanticPOSS. Then, use this structure of the folders:

./
├── 
├── ...
└── path_to_data_shown_in_yaml_config/
      └── sequences
            ├── 00/           
            │   ├── velodyne/	
            |   |	   ├── 000000.bin
            |   |	   ├── 000001.bin
            |   |	   └── ...
            │   └── labels/ 
            |          ├── 000000.label
            |          ├── 000001.label
            |          └── ...
            └── ...

Commands

Pretraining

To run the pretraining:

python main_pretrain.py -s [SPLIT NUMBER] --dataset [SemanticPOSS, SemanticKITTI]

For additional command line arguments, run:

python main_pretrain.py -h

Discovery

To run the discovery step (pretraining is not mandatory):

python main_discover.py -s [SPLIT NUMBER] --dataset [SemanticPOSS, SemanticKITTI]

For additional command line arguments, run:

python main_discover.py -h

To reproduce the paper results run:

python main_discover.py -s [SPLIT NUMBER] --dataset SemanticPOSS --dataset_config [CONFIG_PATH] --num_heads=5 --overcluster_factor=3 --use_scheduler --adapting_epsilon_sk --use_uncertainty_queue --use_uncertainty_loss --uncertainty_percentile=0.3
python main_discover.py -s [SPLIT NUMBER] --dataset SemanticKITTI --dataset_config [CONFIG_PATH] --num_heads=5 --overcluster_factor=3 --use_scheduler --adapting_epsilon_sk --use_uncertainty_queue --use_uncertainty_loss --uncertainty_percentile=0.5

To test with a checkpoint, run:

python main_discover_test.py -s [SPLIT NUMBER] --checkpoint=[PATH TO LIGHTNING CHECKPOINT]

Citing our work

Please cite the following paper if you use our code:

@inproceedings{riz2023novel,
  author={Riz, Luigi and Saltori, Cristiano and Ricci, Elisa and Poiesi, Fabio},
  booktitle={2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, 
  title={Novel Class Discovery for 3D Point Cloud Semantic Segmentation}, 
  year={2023},
  volume={},
  number={},
  pages={9393-9402},
  doi={10.1109/CVPR52729.2023.00906}}

Acknowledgements

This project has received funding from the European Union’s Horizon Europe research and innovation programme under grant agreement No 101058589. This work was also partially supported by the PRIN project LEGO-AI (Prot. 2020TA3K9N), the EU ISFP PROTECTOR (101034216) project and the EU H2020 MARVEL (957337) project and, it was carried out in the Vision and Learning joint laboratory of Fondazione Bruno Kessler and UNITN.

TODOS :soon: