Home

Awesome

Spatio-temporal Relation Modeling for Few-shot Action Recognition (CVPR 2022)

PWC PWC PWC PWC

[Paper][Project Page]

Anirudh Thatipelli, Sanath Narayan, Salman Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, Bernard Ghanem

Installation

The codebase is built on PyTorch 1.9.0 and tested on Ubuntu 18.04 environment (Python3.8.8, CUDA11.0) and trained on 4 GPUs. Build a conda environment using the requirements given in environment.yaml.

Attention Visualization

<img src = "https://imgur.com/CZym9q1.png" width="900">

Results

MethodKineticsSSv2HMDBUCF
CMN-J78.9---
TARN78.5---
ARN82.4-60.683.1
OTAM85.852.3--
HF-AR-55.162.286.4
TRX85.964.675.696.1
STRM [Ours]86.768.177.396.8
<!-- <table> <tr> <img src = "images/SOTA_Table.png" width="900"> </tr> </table> -->

Training and Evaluation

Step 1 : Data preparation

Prepare the datasets according to the splits provided.

Step 2 : Training

Use the scripts given in scripts

Step 3 : Evaluation

  1. Use the evaluation script as given in eval_strm_ssv2.sh

  2. Download the checkpoints from these links: SSV2, Kinetics, HMDB, UCF

Citation

If you find this repository useful, please consider giving a star :star: and citation :confetti_ball::

@inproceedings{thatipelli2021spatio,
  title={Spatio-temporal Relation Modeling for Few-shot Action Recognition},
  author={Thatipelli, Anirudh and Narayan, Sanath and Khan, Salman and Anwer, Rao Muhammad and Khan, Fahad Shahbaz and Ghanem, Bernard},
  booktitle={CVPR},
  year={2022}
}

Acknowledgements

The codebase was built on top of trx. Many thanks to Toby Perrett for previous work.

Contact

Should you have any question, please contact :e-mail: thatipellianirudh@gmail.com or message me on Linkedin.