Home

Awesome

audiosetdl

Modules and scripts for downloading Google's AudioSet dataset, a dataset of ~2.1 million annotated segments from YouTube videos.

Setup

Running

As a single script

SLURM

This can be run as a batch of SLURM jobs

Examples

Examples can be found in the notebooks directory of this repository.

Cases where videos cannot be downloaded

Notes by David

Easiest way to do this is install sox via brew.

Install homebrew

/usr/bin/ruby -e "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/master/install)"

Install wget and audiosetdl

brew install wget

./setup.sh

Install sox

brew install sox --with-flac --with-lame --with-libao --with-libsndfile --with-libvorbis --with-opencore-amr --with-opusfile

utils.run_comand() needs to open POPEN with universal_newlines=True to enable string format otherwise json module will fail.

Youtube is mostly AAC format so it's not necessary to store in any higher quality format, but sox doesn't support AAC. Easiest compressed format to use with sox is flac. Maybe ogg or mp3 will work also.