Awesome
I3D models trained on Kinetics
Overview
This repository contains trained models reported in the paper "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset" by Joao Carreira and Andrew Zisserman.
Tensorflow code is from Deepmind's Kinetics-I3D.
Pytorch code is from Kinetics-I3D
Fine-tuning and Feature Extraction
These models were pretrained on imagenet and kinetics (see original repo) for details).
Something to say
You need to down load the checkpoint from the original repo
- default load the kinetics pre-trained model
- extract features in thumos14 validation and test dataset the extract way is segment the video at uniform interval.
frames | interval | video fps |
---|---|---|
<=15000 | 24 | 30 |
<=30000 | 48 | 30 |
>30000 | 96 | 30 |
In order to reduce the redundancy in frames, we choose to subsample the video to 10fps. A clip includes 48 frames, we sample 16 frames and send to the I3D network to extract [1,1024] features
Feature is generated after Mix_5c and avg_pool layer:
input -> output:
rgb: [1, 16, 224, 224, 3] -> [1024,]
flow:[1, 16, 224, 224, 2] -> [1024,]