Home

Awesome

Video Test-Time Adaptation for Action Recognition (CVPR 2023)

ProjectPage

ViTTA is the first approach of test-time adaptation of video action recognition models against common distribution shifts. ViTTA is tailored to saptio-temporal models and capable of adaptation on a single video sample at a step. It consists in a feature distribution alignment technique that aligns online estimates of test set statistics towards the training statistics. It further enforces prediction consistency over temporally augmented views of the same test video sample.

Official implementation of ViTTA [arXiv]
Author HomePage
🤗 Dataset (12 corruption types of Kinetics 400 and Something-Something v2, and UCF101 data)

Requirements


Data Preparation


Usage

Specify the data paths in the scripts correspondingly (see comments in scripts)


Citation

Thanks for citing our paper:

@inproceedings{lin2023video,
  title={Video Test-Time Adaptation for Action Recognition},
  author={Lin, Wei and Mirza, Muhammad Jehanzeb and Kozinski, Mateusz and Possegger, Horst and Kuehne, Hilde and Bischof, Horst},
  booktitle={CVPR},
  year={2023},
}