Home

Awesome

Mailing list : test Mailing list : test License: CC BY-NC 4.0

Open In Colab

header

<br/> <h1 align="center">Silero VAD</h1> <br/>

Silero VAD - pre-trained enterprise-grade Voice Activity Detector (also see our STT models).

<br/> <p align="center"> <img src="https://github.com/snakers4/silero-vad/assets/36505480/300bd062-4da5-4f19-9736-9c144a45d7a7" /> </p> <details> <summary>Real Time Example</summary>

https://user-images.githubusercontent.com/36505480/144874384-95f80f6d-a4f1-42cc-9be7-004c891dd481.mp4

</details> <br/> <h2 align="center">Fast start</h2> <br/>

Using pip: pip install silero-vad

from silero_vad import load_silero_vad, read_audio, get_speech_timestamps
model = load_silero_vad()
wav = read_audio('path_to_audio_file') # backend (sox, soundfile, or ffmpeg) required!
speech_timestamps = get_speech_timestamps(wav, model)

Using torch.hub:

import torch
torch.set_num_threads(1)

model, utils = torch.hub.load(repo_or_dir='snakers4/silero-vad', model='silero_vad')
(get_speech_timestamps, _, read_audio, _, _) = utils

wav = read_audio('path_to_audio_file') # backend (sox, soundfile, or ffmpeg) required!
speech_timestamps = get_speech_timestamps(wav, model)
<br/> <h2 align="center">Key Features</h2> <br/> <br/> <h2 align="center">Typical Use Cases</h2> <br/> <br/> <h2 align="center">Links</h2> <br/> <br/> <h2 align="center">Get In Touch</h2> <br/>

Try our models, create an issue, start a discussion, join our telegram chat, email us, read our news.

Please see our wiki for relevant information and email us directly.

Citations

@misc{Silero VAD,
  author = {Silero Team},
  title = {Silero VAD: pre-trained enterprise-grade Voice Activity Detector (VAD), Number Detector and Language Classifier},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/snakers4/silero-vad}},
  commit = {insert_some_commit_here},
  email = {hello@silero.ai}
}
<br/> <h2 align="center">Examples and VAD-based Community Apps</h2> <br/>