Home

Awesome

<div align="center"> <h1>RecurrentBEV</h1> <h3>[ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection</h3> </div> <div align="center"> <img src="resources/paper_paradigm.jpg" width="800"/> </div><br/>

Introduction

This repository is an official implementation of RecurrentBEV. It is built based on MMDetection3D.

Main Results

<img title="" src="resources/res50_bench.jpg" alt="" data-align="left" width="600">

NuScenes Val Set

BackboneImg SizePretrainNDSmAPConfigDownload
Res50256x704ImageNet54.944.5configmodel
Res101512x1408ImageNet59.950.9config-
Res101512x1408NuImages61.252.8configmodel

NuScenes Test Set

BackboneImg SizePretrainNDSmAPConfigDownload
V2-99640x1600DD3D65.157.3configmodel
ConvNeXt-B640x1600COCO65.157.4config-

Inference Speed

The below table shows end-to-end FPS (Frames Per Second) of RecurrentBEV measured with a single RTX-3090.

MethodPytorch-FP32TensorRT-FP32TensorRT-FP16TensorRT-INT8
RecurrentBEV25.646.3129.3234.8
StreamPETR26.753.9134.6167.4

Getting Started

Please follow our documentation step by step. If you like our work, please recommend it to your colleagues and friends.

  1. Environment Setup.

  2. Data Preparation.

  3. Training and Inference.

  4. Visualization.

  5. Deployment.

Features List

Acknowledgements

We thank these great works and open-source codebases:

Citation

If you find RecurrentBEV is useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry.