Awesome

Mix and Localize: Localizing Sound Sources in Mixtures

<h4> Xixi Hu*, Ziyang Chen*,Andrew Owens </br> <span style="font-size: 14pt; color: #555555"> University of Michigan </span> </br> CVPR 2022 </h4> <hr>

This repository contains the official codebase for Mix and Localize: Localizing Sound Sources in Mixtures. [Project Page]

MUSIC Dataset

Download the MUSIC dataset here: MUSIC repo

Postprocess the MUSIC dataset and extract the frames and audio clips. The structure of the dataset folder is as follow.

data
  └──MUSIC 
  │    ├──data-splits
  │    ├──MUSIC_raw
  │           ├──duet
  │           ├──solo
  │                └── [class_label]
  │                         └── [ytid]
  │                               ├── audio
  │                               │      ├──audio_clips
  │                               │             ├── 00000.wav       # 1 second audio clips
  │                               │             ├── 00001.wav
  │                               │             ├── ...
  │                               └── frames
  │                                      ├── 00000.jpg              # fps = 4
  │                                      ├── ...

Training

python train.py