Home

Awesome

Grouped Spatial-Temporal Aggretation for Efficient Action Recognition

Pytorch implementation of paper Grouped Spatial-Temporal Aggretation for Efficient Action Recognition. arxiv

Prerequisites

Data preparation

Please refer to TRN-pytorch for data preparation on Something-Something.

Training

Pretrained Models

Something-v1Something-v2
GST(alpha=4, 8 frames)47.061.6
GST(alpha=4,16 frames)48.662.6
GST-Large(alpha=4,8 frames)47.762.0

Reference

If you find our work useful in your research, please consider citing our paper

@inproceedings{luo2019grouped,
  title={Grouped Spatial-Temporal Aggretation for Efficient Action Recognition},
  author={Luo, Chenxu and Yuille, Alan},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  year={2019}
} 

or

@article{luo2019grouped,
  title={Grouped Spatial-Temporal Aggregation for Efficient Action Recognition},
  author={Luo, Chenxu and Yuille, Alan},
  journal={arXiv preprint arXiv:1909.13130},
  year={2019}
}

Acknowledge

This codebase is build upon TRN-pytorch and TSN-pytorch