Home

Awesome

Awesome Audio-Visual: Awesome

A curated list of papers and datsets for various audio-visual tasks, inspired by awesome-computer-vision.

Contents

Audio-Visual Localization

Audio-Visual Separation

Audio-Visual Representation/Classification/Retrieval

Audio-Visual Action Recognition

Audio-Visual Spatial/Depth

Audio-Visual RIR

Audio-Visual Highlight Detection

Audio-Visual Deepfake/Robustness

Lightweight Audio-Visual Model

Audio-Visual Navigation/RL

Audio-Visual Faces/Speech

Audio-Visual Learning of Scene Acoustics

Audio-Visual Question Answering

Cross-modal Generation (Audio-Video / Video-Audio)

Audio-Visual Stylization/Generation

Multi-modal Architectures

Uncategorized Papers

Datasets

General Audio-Visual Tasks

Face-Voice Dataset

Licenses

License

CC0

To the extent possible under law, Kranti Kumar Parida has waived all copyright and related or neighboring rights to this work.

Contributing

Please feel free to send me pull requests or email (kranti@cse.iitk.ac.in) to add links, correct wrong ones or if you find any broken links.