Awesome
AutoAD Project
- AutoAD III: The Prequel -- Back to the Pixels. [CVPR'24]. T. Han, M. Bain, A. Nagrani, G. Varol, W. Xie and A. Zisserman. [PDF]
- AutoAD II: The Sequel – Who, When, and What in Movie Audio Description [ICCV'23]. T. Han, M. Bain, A. Nagrani, G. Varol, W. Xie and A. Zisserman. [PDF]
- AutoAD I: Movie Description in Context [CVPR'23 Highlight]. T. Han*, M. Bain*, A. Nagrani, G. Varol, W. Xie and A. Zisserman. [PDF]
[project page]
News :mega:
- 2024.04.22: AutoAD-III paper released. Model weights and examples AD outputs are available here. More code and datasets coming soon.
<img src="asset/v3_figure.jpg" width="600">
Details
Reference
@InProceedings{han2024autoad3,
title={{AutoAD III: The Prequel} - Back to the Pixels},
author={Tengda Han and Max Bain and Arsha Nagrani and G\"ul Varol and Weidi Xie and Andrew Zisserman},
booktitle={CVPR},
year={2024}}
@InProceedings{han2023autoad2,
title={{AutoAD II: The Sequel} - Who, When, and What in Movie Audio Description},
author={Tengda Han and Max Bain and Arsha Nagrani and G\"ul Varol and Weidi Xie and Andrew Zisserman},
booktitle={ICCV},
year={2023}}
@InProceedings{han2023autoad1,
title={{AutoAD}: Movie Description in Context},
author={Tengda Han and Max Bain and Arsha Nagrani and G\"ul Varol and Weidi Xie and Andrew Zisserman},
booktitle={CVPR},
year={2023}}