Home

Awesome

wsj0-mix generation scripts, in Python

The generate_wsjmix.py reproduces the original Matlab generation scripts for 2 and 3 sources (around 70dB SNR between Matlab and Python output files), and also works for 4 and 5 sources, as introduced here.

Many thanks to @Jonathan-LeRoux for providing activlev outputs, verifying the match and debugging the code!

Usage

To generate wsj0-mix data, run (after git clone pywsj0-mix && pywsj0-mix):

python generate_wsjmix.py -p wsj0_path -o output_path -n n_sources -sr sr

Where:

To generate all the possible versions of wsj0-mix (around 330Go), run

for nsrc in 2 3 4 5; do
  for sr in 8000 16000; do
    python generate_wsjmix.py -p wsj0_path -o output_path -n $nsrc -sr $sr
  done
done