Home

Awesome

TCFormer (CVPR'2022 Oral, TPAMI'2024)

[CVPR'2022 paper] [TPAMI'2024 paper]

Introduction

Official code repository for the papers:
Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer
[Wang Zeng, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang, and Xiaogang Wang]

and

TCFormer: Visual Recognition via Token Clustering Transformer
[Wang Zeng, Sheng Jin, Lumin Xu, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, and Xiaogang Wang]

teaser

TODO

Model Zoo

You can find the pretrained checkpoints here.

Image Classification

Classification configs & weights see >>>here<<<.

MethodSizeAcc@1#Params (M)ConfigCheckpointlog
TCFormer-light22479.414.2Mconfig57M [Google][Google]
TCFormer22482.325.6Mconfig103M [Google][Google]
TCFormer-large22483.662.8Mconfig103M [Google][Google]

WholeBody Estimation

WholeBody Estimation configs & weights see >>>here<<<.

ArchInput SizeBody APBody ARFoot APFoot ARFace APFace ARHand APHand ARWhole APWhole ARckptlog
TCFormer256x1920.6970.7740.7050.8210.6560.7530.5390.6520.5760.681ckptlog
TCFormer_large384x2880.7180.7940.7440.8500.7900.8560.6140.7150.6420.733ckptlog

Citation

If you find this project useful in your research, please cite:

@inproceedings{zeng2022not,
  title={Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer},
  author={Zeng, Wang and Jin, Sheng and Liu, Wentao and Qian, Chen and Luo, Ping and Ouyang, Wanli and Wang, Xiaogang},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={11101--11111},
  year={2022}
}

@article{zeng2024tcformer,
  title={TCFormer: Visual Recognition via Token Clustering Transformer},
  author={Zeng, Wang and Jin, Sheng and Xu, Lumin and Liu, Wentao and Qian, Chen and Ouyang, Wanli and Luo, Ping and Wang, Xiaogang},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year={2024},
  publisher={IEEE}
}

Acknowledgement

Thanks to:

License

This project is released under the Apache 2.0 license.