Home

Awesome

Speech-Mamba

This is the implementation of the SLT paper Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces Models.

Set Up

We run experiments based on SpeechBrain toolkit. To install SpeechBrain, run following commands:

git clone https://github.com/speechbrain/speechbrain.git
cd speechbrain
pip install -r requirements.txt
pip install --editable .

Please also install Pytorch and other dependencies:

pip install sox librosa soundfile
pip install ctc-segmentation
pip install kaldilm
pip install mamba-ssm

Dataset

You can download LibriSpeech at http://www.openslr.org/12

Train

cd ./speechbrain/recipes/LibriSpeech/ASR/transformer

python train.py hparams/mamba.yaml

Test

python train.py hparams/mamba_test.yaml --test_only

Citation

Please cite the following paper:

@article{gao2024speech,
  title={Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces Models},
  author={Gao, Xiaoxue and Chen, Nancy F},
  journal={arXiv preprint arXiv:2409.18654},
  year={2024}
}