Home

Awesome

This repo contains the official PyTorch code for Slide-Transformer.

Introduction

<p align="center"> <img src="figs/intro.png" width= "400"> </p>

Method

<p align="center"> <img src="figs/method.png" width= "1000"> </p> <p align="center"> <img src="figs/method_shift.png" width= "400"> </p>

Results

Classification

<p align="center"> <img src="figs/exp_cls.png" width= "1000"> </p> <p align="center"> <img src="figs/exp_det.png" width= "1000"> </p> <p align="center"> <img src="figs/exp_seg.png" width= "400"> </p> <p align="center"> <img src="figs/exp_speed.png" width= "1000"> </p>

Acknowledgements

Our code is developed on the top of PVT, Swin Transformer, CSwin Transformer.

Citation

If you find this repo helpful, please consider citing us.

@inproceedings{pan2023slide,
  title={Slide-transformer: Hierarchical vision transformer with local self-attention},
  author={Pan, Xuran and Ye, Tianzhu and Xia, Zhuofan and Song, Shiji and Huang, Gao},
  booktitle={Proceedings of the IEEE/CVF conference on computer vision and pattern recognition},
  pages={2082--2091},
  year={2023}
}