Home

Awesome

HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization (ECCV 2024)

Sakib Reza, Yuexi Zhang, Mohsen Moghaddam, Octavia Camps

Northeastern University, Boston, United States

{reza.s,zhang.yuex,mohsen,o.camps}@northeastern.edu

Arxiv Preprint

Updates

Installation

Prerequisites

Requirements

To install all required libraries, execute the pip command below.

pip install -r requirement.txt

Training

Input Features

The Kinetics I3D pre-trained feature of EGTEA dataset can be downloaded from GDrive link.
Files should be located in 'data/'.
You can get other features from the following links -

Config Files

The configuration files for EGTEA are already provided in the repository. For other datasets, they can be downloaded from GDrive link.

Training Model

To train the main HAT model, execute the command below.

python main.py --mode=train --split=[split #]*

*If the dataset has any splits (e.g., EGTEA has 4 splits)

To train the post-processing network (OSN), execute the commands below.

python supnet.py --mode=make --inference_subset=train --split=[split #]
python supnet.py --mode=make --inference_subset=test --split=[split #]
python supnet.py --mode=train --split=[split #]

Testing

To test HAT, execute the command below.

python main.py --mode=test --split=[split #]

Citing HAT

Please cite our paper in your publications if it helps your research:

@inproceedings{reza2022history,
  title={HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization},
  author={Reza, Sakib and Zhang, Yuexi and Moghaddam, Mohsen and Camps, Octavia},
  booktitle={European Conference on Computer Vision},
  pages={XXX--XXX},
  year={2024},
  organization={Springer}
}

Acknowledgment

This repository is created based on the repository of the baseline work OAT-OSN.