Home

Awesome

OC-SORT

arXiv License: MIT test

Observation-Centric SORT (OC-SORT) is a pure motion-model-based multi-object tracker. It aims to improve tracking robustness in crowded scenes and when objects are in non-linear motion. It is designed by recognizing and fixing limitations in Kalman filter and SORT. It is flexible to integrate with different detectors and matching modules, such as appearance similarity. It remains, Simple, Online and Real-time.

Pipeline

<center> <img src="assets/teaser.png" width="600"/> </center>

Observation-centric Re-Update

<center> <img src="assets/ocr.png" width="600"/> </center>

News

Benchmark Performance

PWC PWC PWC PWC PWC

DatasetHOTAAssAIDF1MOTAFPFNIDsFrag
MOT17 (private)63.263.277.578.015,129107,0551,9502,040
MOT17 (public)52.457.665.158.24,379230,4497842,006
MOT20 (private)62.462.576.475.920,218103,7919381,004
MOT20 (public)54.359.567.059.94,434202,5025542,345
KITTI-cars76.576.4-90.32,685407250280
KITTI-pedestrian54.759.1-65.16,4221,443204609
DanceTrack-test55.138.054.289.4114,107139,0831,9923,838
CroHD HeadTrack44.1-62.967.9102,050164,0904,24310,122

Get Started

Demo

To run the tracker on a provided demo video from Youtube:

python3 tools/demo_track.py --demo_type video -f exps/example/mot/yolox_dancetrack_test.py -c pretrained/ocsort_dance_model.pth.tar --path videos/dance_demo.mp4 --fp16 --fuse --save_result --out_path demo_out.mp4
<center> <img src="assets/dance_demo.gif" width="600"/> </center>

Roadmap

We are still actively updating OC-SORT. We always welcome contributions to make it better for the community. We have some high-priorty to-dos as below:

Acknowledgement and Citation

The codebase is built highly upon YOLOX, filterpy, and ByteTrack. We thank their wondeful works. OC-SORT, filterpy and ByteTrack are available under MIT License. And YOLOX uses Apache License 2.0 License.

If you find this work useful, please consider to cite our paper:

@inproceedings{cao2023observation,
  title={Observation-centric sort: Rethinking sort for robust multi-object tracking},
  author={Cao, Jinkun and Pang, Jiangmiao and Weng, Xinshuo and Khirodkar, Rawal and Kitani, Kris},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={9686--9696},
  year={2023}
}