Awesome
Audio Mamba: Pretrained Audio Mamba for Audio Pattern Recognition
Introduction
The Code Repository for "Audio Mamba: Pretrained Audio Mamba for Audio Pattern Recognition"
Getting Started
Environments
The codebase is developed with pytorch == 1.8.1, torch-lightning == 1.5.9 Install requirements as follows:
pip install -r requirements.txt
Download and Processing Datasets
- config.py
change the varible "dataset_path" to your audioset address
change the variable "desed_folder" to your DESED address
change the classes_num to 527
./create_index.sh #
// remember to change the pathes in the script
// more information about this script is in https://github.com/qiuqiangkong/audioset_tagging_cnn
python main.py save_idc
// count the number of samples in each class and save the npy files
Open the jupyter notebook at esc-50/prep_esc50.ipynb and process it
Open the jupyter notebook at scv2/prep_scv2.ipynb and process it
python conver_desed.py
// will produce the npy data files
Set the Configuration File: config.py
The script config.py contains all configurations you need to assign to run your code. Please read the introduction comments in the file and change your settings.
Training
TBD
Results
TBD