Home

Awesome

PWC

PWC

PWC

PWC

Table of contents
  1. Features
  2. Datasets
  3. Installation
  4. Data Preparation
  5. Training and Testing
  6. Quick Demo
  7. Visualization
  8. Acknowledgments
  9. Contacts

ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic Convolution

<a href="https://arxiv.org/abs/2303.00246"><img src="https://img.shields.io/badge/https%3A%2F%2Farxiv.org%2Fabs%2F2303.00246-arxiv-brightgreen"></a>

Tuan Duc Ngo, Binh-Son Hua, Khoi Nguyen<br> VinAI Research, Vietnam

Abstract: Existing 3D instance segmentation methods are predominant by a bottom-up design: a manually fine-tuned algorithm to group points into clusters followed by a refinement network. Relying on the quality of the clusters, these methods generate susceptible results when (1) nearby objects with the same semantic class are packed together, or (2) large objects with complex shapes. To address these shortcomings, we introduce ISBNet, a novel cluster-free method that represents instances as kernels and decodes instance masks via dynamic convolution. To efficiently generate a high-recall and discriminative kernel set, we propose a simple strategy, named Instance-aware Farthest Point Sampling, to sample candidates and leverage the point aggregation layer adopted from PointNet++ to encode candidate features. Moreover, we show that training 3D instance segmentation in a multi-task learning setting with an additional axis-aligned bounding box prediction head further boosts performance. Our method set new state-of-the-art results on ScanNetV2 (55.9), S3DIS (60.8), and STPLS3D (49.2) in terms of AP and retains fast inference time (237ms per scene on ScanNetV2).

overview

Details of the model architecture and experimental results can be found in our paper:

@inproceedings{ngo2023isbnet,
 author={Tuan Duc Ngo, Binh-Son Hua, Khoi Nguyen},
 booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
 title={ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic Convolution},
 year= {2023}
}

Please CITE our paper whenever this repository is used to help produce published results or incorporated into other software.

Features :mega:

Datasets :floppy_disk:

Installation :memo:

Please refer to installation guide.

Data Preparation :hammer:

Please refer to data preparation.

Training and Testing :train2:

Please refer to training guide.

Quick Demo :fire:

ScanNetv2

DatasetAPAP_50ConfigCheckpoint
ScanNet test55.976.3
ScanNet val (paper)54.573.1
ScanNet val56.873.3configcheckpoint
ScanNet val (lightweight)50.168.9configcheckpoint

ScanNetv2-200

DatasetAPAP_50ConfigCheckpoint
ScanNet200 val24.532.7configcheckpoint

S3DIS

DatasetAPAP_50ConfigCheckpoint
Area 556.367.5configcheckpoint

STPLS3D

DatasetAPAP_50ConfigCheckpoint
STPLS3D val51.266.7configcheckpoint

Run evaluation with pre-trained models:

python3 tools/test.py <path_to_config_file> <path_to_pretrain_weight>

Visualization :computer:

Please refer to visualization guide. We provide the qualitative results of our method at here

Acknowledgements :clap:

This repo is built upon SpConv, DyCo3D, SSTNet, and SoftGroup.

Contacts :email:

If you have any questions or suggestions about this repo, please feel free to contact me (ductuan.ngo99@gmail.com).