Awesome

English | <a href="./README_CN.md">简体中文</a> <img src="docs/figs/logo.png" align="center" width="22.5%"> <h3 align="center">Robo3D: Towards Robust and Reliable 3D Perception against Corruptions</h3> <a href="https://scholar.google.com/citations?user=-j1j7TkAAAAJ" target='_blank'>Lingdong Kong</a>1,2,*    <a href="https://github.com/youquanl" target='_blank'>Youquan Liu</a>1,3,*    <a href="https://scholar.google.com/citations?user=7atts2cAAAAJ" target='_blank'>Xin Li</a>1,4,*    <a href="https://scholar.google.com/citations?user=Uq2DuzkAAAAJ" target='_blank'>Runnan Chen</a>1,5    <a href="https://scholar.google.com/citations?user=QDXADSEAAAAJ" target='_blank'>Wenwei Zhang</a>1,6 <a href="https://scholar.google.com/citations?user=YUKPVCoAAAAJ" target='_blank'>Jiawei Ren</a>6    <a href="https://scholar.google.com/citations?user=lSDISOcAAAAJ" target='_blank'>Liang Pan</a>6    <a href="https://scholar.google.com/citations?user=eGD0b7IAAAAJ" target='_blank'>Kai Chen</a>1    <a href="https://scholar.google.com/citations?user=lc45xlcAAAAJ" target='_blank'>Ziwei Liu</a>6 1Shanghai AI Laboratory    2National University of Singapore    3Hochschule Bremerhaven    4East China Normal University    5The University of Hong Kong    6S-Lab, Nanyang Technological University <a href="https://arxiv.org/abs/2303.17597" target='_blank'> <img src="https://img.shields.io/badge/Paper-%F0%9F%93%83-slategray"> </a> <a href="https://ldkong.com/Robo3D" target='_blank'> <img src="https://img.shields.io/badge/Project-%F0%9F%94%97-lightblue"> </a> <a href="" target='_blank'> <img src="https://img.shields.io/badge/Demo-%F0%9F%8E%AC-pink"> </a> <a href="https://zhuanlan.zhihu.com/p/672935761" target='_blank'> <img src="https://img.shields.io/badge/%E4%B8%AD%E8%AF%91%E7%89%88-%F0%9F%90%BC-red"> </a> <a href="" target='_blank'> <img src="https://visitor-badge.laobi.icu/badge?page_id=ldkong1205.Robo3D&left_color=gray&right_color=firebrick"> </a>

About

Robo3D is an evaluation suite heading toward robust and reliable 3D perception in autonomous driving. With it, we probe the robustness of 3D detectors and segmentors under out-of-distribution (OoD) scenarios against corruptions that occur in the real-world environment. Specifically, we consider natural corruptions happen in the following cases:

Adverse weather conditions, such as fog, wet ground, and snow;
External disturbances that are caused by motion blur or result in LiDAR beam missing;
Internal sensor failure, including crosstalk, possible incomplete echo, and cross-sensor scenarios.


<img src="docs/figs/teaser/clean.png" width="240">	<img src="docs/figs/teaser/fog.png" width="240">	<img src="docs/figs/teaser/wet_ground.png" width="240">
Clean	Fog	Wet Ground
<img src="docs/figs/teaser/snow.png" width="240">	<img src="docs/figs/teaser/motion_blur.png" width="240">	<img src="docs/figs/teaser/beam_missing.png" width="240">
Snow	Motion Blur	Beam Missing
<img src="docs/figs/teaser/crosstalk.png" width="240">	<img src="docs/figs/teaser/incomplete_echo.png" width="240">	<img src="docs/figs/teaser/cross_sensor.png" width="240">
Crosstalk	Incomplete Echo	Cross-Sensor

Visit our project page to explore more examples. :oncoming_automobile:

Updates

[2024.05] - Check out the technical report of this competition: The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition :blue_car:.
[2024.05] - The slides of the 2024 RoboDrive Workshop are available here :arrow_heading_up:.
[2024.05] - The video recordings are available on YouTube :arrow_heading_up: and Bilibili :arrow_heading_up:.
[2024.05] - We are glad to announce the winning teams of the 2024 RoboDrive Challenge:
- Track 1: Robust BEV Detection
  - :1st_place_medal: DeepVision, :2nd_place_medal: Ponyville Autonauts Ltd, :3rd_place_medal: CyberBEV
- Track 2: Robust Map Segmentation
  - :1st_place_medal: SafeDrive-SSR, :2nd_place_medal: CrazyFriday, :3rd_place_medal: Samsung Research
- Track 3: Robust Occupancy Prediction
  - :1st_place_medal: ViewFormer, :2nd_place_medal: APEC Blue, :3rd_place_medal: hm.unilab
- Track 4: Robust Depth Estimation
  - :1st_place_medal: HIT-AIIA, :2nd_place_medal: BUAA-Trans, :3rd_place_medal: CUSTZS
- Track 5: Robust Multi-Modal BEV Detection
  - :1st_place_medal: safedrive-promax, :2nd_place_medal: Ponyville Autonauts Ltd, :3rd_place_medal: HITSZrobodrive
[2024.01] - The toolkit tailored for the 2024 RoboDrive Challenge has been released. :hammer_and_wrench:
[2023.12] - We are hosting the RoboDrive Challenge at ICRA 2024. :blue_car:
[2023.09] - Intend to improve the OoD robustness of your 3D perception models? Check out our recent work, Seal :seal:, an image-to-LiDAR self-supervised pretraining framework that leverages off-the-shelf knowledge from vision foundation models for cross-modality representation learning.
[2023.07] - Robo3D was accepted to ICCV 2023! :tada:
[2023.03] - We establish "Robust 3D Perception" leaderboards on Paper-with-Code: 1KITTI-C, 2SemanticKITTI-C, 3nuScenes-C, and 4WOD-C. Join the challenge today! :raising_hand:
[2023.03] - The KITTI-C, SemanticKITTI-C, and nuScenes-C datasets are ready for download at the OpenDataLab platform. Kindly refer to this page for more details on preparing these datasets. :beers:
[2023.01] - Launch of the Robo3D benchmark. In this initial version, we include 12 detectors and 22 segmentors, evaluated on 4 large-scale autonomous driving datasets (KITTI, SemanticKITTI, nuScenes, and Waymo Open) with 8 corruption types across 3 severity levels.

Taxonomy


<img src="docs/figs/demo/bev_fog.gif" width="180">	<img src="docs/figs/demo/bev_wet_ground.gif" width="180">	<img src="docs/figs/demo/bev_snow.gif" width="180">	<img src="docs/figs/demo/bev_motion_blur.gif" width="180">
<img src="docs/figs/demo/rv_fog.gif" width="180">	<img src="docs/figs/demo/rv_wet_ground.gif" width="180">	<img src="docs/figs/demo/rv_snow.gif" width="180">	<img src="docs/figs/demo/rv_motion_blur.gif" width="180">
Fog	Wet Ground	Snow	Motion Blur

<img src="docs/figs/demo/bev_beam_missing.gif" width="180">	<img src="docs/figs/demo/bev_crosstalk.gif" width="180">	<img src="docs/figs/demo/bev_incomplete_echo.gif" width="180">	<img src="docs/figs/demo/bev_cross_sensor.gif" width="180">
<img src="docs/figs/demo/rv_beam_missing.gif" width="180">	<img src="docs/figs/demo/rv_crosstalk.gif" width="180">	<img src="docs/figs/demo/rv_incomplete_echo.gif" width="180">	<img src="docs/figs/demo/rv_cross_sensor.gif" width="180">
Beam Missing	Crosstalk	Incomplete Echo	Cross-Sensor

Video Demo

Demo 1	Demo 2	Demo 3
<img width="100%" src="docs/figs/demo1.png">	<img width="100%" src="docs/figs/demo2.png">	<img width="100%" src="docs/figs/demo3.png">
Link <sup>:arrow_heading_up:</sup>	Link <sup>:arrow_heading_up:</sup>	Link <sup>:arrow_heading_up:</sup>

Installation

For details related to installation, kindly refer to INSTALL.md.

Data Preparation

Our datasets are hosted by OpenDataLab.

<img src="https://raw.githubusercontent.com/opendatalab/dsdl-sdk/2ae5264a7ce1ae6116720478f8fa9e59556bed41/resources/opendatalab.svg" width="32%"/> OpenDataLab is a pioneering open data platform for the large AI model era, making datasets accessible. By using OpenDataLab, researchers can obtain free formatted datasets in various fields.

Kindly refer to DATA_PREPARE.md for the details to prepare the 1KITTI, 2KITTI-C, 3SemanticKITTI, 4SemanticKITTI-C, 5nuScenes, 6nuScenes-C, 7WOD, and 8WOD-C datasets.

Getting Started

To learn more usage about this codebase, kindly refer to GET_STARTED.md.

Model Zoo

<details open> <summary>&nbspLiDAR Semantic Segmentation</summary>

SqueezeSeg, ICRA 2018. [Code]

SqueezeSegV2, ICRA 2019. [Code]

MinkowskiNet, CVPR 2019. [Code]

RangeNet++, IROS 2019. [Code]

KPConv, ICCV 2019. [Code]

SalsaNext, ISVC 2020. [Code]

RandLA-Net, CVPR 2020. [Code]

PolarNet, CVPR 2020. [Code]

3D-MiniNet, IROS 2020. [Code]

SPVCNN, ECCV 2020. [Code]

Cylinder3D, CVPR 2021. [Code]

FIDNet, IROS 2021. [Code]

RPVNet, ICCV 2021.

CENet, ICME 2022. [Code]

CPGNet, ICRA 2022. [Code]

2DPASS, ECCV 2022. [Code]

GFNet, TMLR 2022. [Code]

PCB-RandNet, arXiv 2022. [Code]

PIDS, WACV 2023. [Code]

SphereFormer, CVPR 2023. [Code]

WaffleIron, ICCV 2023. [Code]

FRNet, arXiv 2023. [Code]

</details> <details open> <summary>&nbspLiDAR Panoptic Segmentation</summary>

DS-Net, CVPR 2021. [Code]

Panoptic-PolarNet, CVPR 2021. [Code]

<details open> <summary>&nbsp3D Object Detection</summary>

SECOND, Sensors 2018. [Code]

PointPillars, CVPR 2019. [Code]

PointRCNN, CVPR 2019. [Code]

Part-A2, T-PAMI 2020.

PV-RCNN, CVPR 2020. [Code]

3DSSD, CVPR 2020. [Code]

SA-SSD, CVPR 2020. [Code]

CenterPoint, CVPR 2021. [Code]

PV-RCNN++, IJCV 2022. [Code]

SphereFormer, CVPR 2023. [Code]

</details>

Benchmark

LiDAR Semantic Segmentation

The mean Intersection-over-Union (mIoU) is consistently used as the main indicator for evaluating model performance in our LiDAR semantic segmentation benchmark. The following two metrics are adopted to compare among models' robustness:

mCE (the lower the better): The average corruption error (in percentage) of a candidate model compared to the baseline model, which is calculated among all corruption types across three severity levels.
mRR (the higher the better): The average resilience rate (in percentage) of a candidate model compared to its "clean" performance, which is calculated among all corruption types across three severity levels.