Home

Awesome

English | ็ฎ€ไฝ“ไธญๆ–‡

<h2 align="center">RT-DETR: DETRs Beat YOLOs on Real-time Object Detection</h2> <p align="center"> <!-- <a href="https://github.com/lyuwenyu/RT-DETR/blob/main/LICENSE"> <img alt="license" src="https://img.shields.io/badge/LICENSE-Apache%202.0-blue"> </a> --> <a href="https://github.com/lyuwenyu/RT-DETR/blob/main/LICENSE"> <img alt="license" src="https://img.shields.io/github/license/lyuwenyu/RT-DETR"> </a> <a href="https://github.com/lyuwenyu/RT-DETR/pulls"> <img alt="prs" src="https://img.shields.io/github/issues-pr/lyuwenyu/RT-DETR"> </a> <a href="https://github.com/lyuwenyu/RT-DETR/issues"> <img alt="issues" src="https://img.shields.io/github/issues/lyuwenyu/RT-DETR?color=pink"> </a> <a href="https://github.com/lyuwenyu/RT-DETR"> <img alt="issues" src="https://img.shields.io/github/stars/lyuwenyu/RT-DETR"> </a> <a href="https://arxiv.org/abs/2304.08069"> <img alt="arXiv" src="https://img.shields.io/badge/arXiv-2304.08069-red"> </a> <a href="mailto: lyuwenyu@foxmail.com"> <img alt="emal" src="https://img.shields.io/badge/contact_me-email-yellow"> </a> </p>

This is the official implementation of papers

<details> <summary>Fig</summary> <table><tr> <td><img src=https://github.com/lyuwenyu/RT-DETR/assets/77494834/0ede1dc1-a854-43b6-9986-cf9090f11a61 border=0 width=500></td> <td><img src=https://github.com/user-attachments/assets/437877e9-1d4f-4d30-85e8-aafacfa0ec56 border=0 width=500></td> </tr></table> </details>

๐Ÿš€ Updates

๐Ÿ“ Implementations

ModelInput shapeDataset$AP^{val}$$AP^{val}_{50}$Params(M)FLOPs(G)T4 TensorRT FP16(FPS)
RT-DETR-R18640COCO46.563.82060217
RT-DETR-R34640COCO48.966.83192161
RT-DETR-R50-m640COCO51.369.636100145
RT-DETR-R50640COCO53.171.342136108
RT-DETR-R101640COCO54.372.77625974
RT-DETR-HGNetv2-L640COCO53.071.632110114
RT-DETR-HGNetv2-X640COCO54.873.16723474
RT-DETR-R18640COCO + Objects36549.266.62060217
RT-DETR-R50640COCO + Objects36555.373.442136108
RT-DETR-R101640COCO + Objects36556.274.67625974
RT-DETRv2-S640COCO48.1 <font color=green>(+1.6)</font>65.12060217
RT-DETRv2-M<sup>*<sup>640COCO49.9 <font color=green>(+1.0)</font>67.53192161
RT-DETRv2-M640COCO51.9 <font color=green>(+0.6)</font>69.936100145
RT-DETRv2-L640COCO53.4 <font color=green>(+0.3)</font>71.642136108
RT-DETRv2-X640COCO54.372.8 <font color=green>(+0.1)</font>7625974

Notes:

๐Ÿฆ„ Performance

๐Ÿ•๏ธ Complex Scenarios

<div align="center"> <img src="https://github.com/lyuwenyu/RT-DETR/assets/77494834/52743892-68c8-4e53-b782-9f89221739e4" width=500 > </div>

๐ŸŒ‹ Difficult Conditions

<div align="center"> <img src="https://github.com/lyuwenyu/RT-DETR/assets/77494834/213cf795-6da6-4261-8549-11947292d3cb" width=500 > </div>

Citation

If you use RT-DETR or RTDETRv2 in your work, please use the following BibTeX entries:

@misc{lv2023detrs,
      title={DETRs Beat YOLOs on Real-time Object Detection},
      author={Yian Zhao and Wenyu Lv and Shangliang Xu and Jinman Wei and Guanzhong Wang and Qingqing Dang and Yi Liu and Jie Chen},
      year={2023},
      eprint={2304.08069},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

@misc{lv2024rtdetrv2improvedbaselinebagoffreebies,
      title={RT-DETRv2: Improved Baseline with Bag-of-Freebies for Real-Time Detection Transformer}, 
      author={Wenyu Lv and Yian Zhao and Qinyao Chang and Kui Huang and Guanzhong Wang and Yi Liu},
      year={2024},
      eprint={2407.17140},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2407.17140}, 
}