Home

Awesome

[ICCV2023] D3G:Exploring Gaussian Prior for Temporal Sentence Grounding with Glance Annotation

d3g

Datasets

Please download the video features to directory dataset as follows.

dataset
├── Charades_STA
│   ├── vgg_rgb_features.hdf5
│   ├── glance_charades_train.json
│   ├── charades_test.json
├── ActivityNet
│   ├── sub_activitynet_v1-3.c3d.hdf5
│   ├── glance_train.json
│   ├── val.json
│   ├── test.json
├── TACoS
│   ├── tall_c3d_features.hdf5
│   ├── glance_train.json
│   ├── val.json
│   ├── test.json

Main Results

Charades-STA Dataset

MethodRank1@0.5Rank1@0.7Rank5@0.5Rank5@0.7
ViGA36.5616.1048.9025.86
D3G41.6419.6079.2549.30

ActivityNet Captions Dataset

MethodRank1@0.3Rank1@0.5Rank1@0.7Rank5@0.3Rank5@0.5Rank5@0.7
ViGA59.7835.3916.2572.1953.1932.69
D3G58.2536.6818.5487.8474.2152.47

TACoS Dataset

MethodRank1@0.3Rank1@0.5Rank1@0.7Rank5@0.3Rank5@0.5Rank5@0.7
ViGA20.829.523.1027.9215.356.10
D3G26.9912.624.7754.7131.5912.10

Training & Inference

cd scipts 
### charades 
sh charades_train.sh # train 
sh charades_test.sh # test 

### activitynet
sh anet_train.sh # train 
sh anet_test.sh # test 

### tacos 
sh tacos_train.sh  # train 
sh tacos_test.sh # test