Home

Awesome

Mailing list : test Mailing list : test License: CC BY-NC 4.0 downloads

Open In Colab

header

<br/> <h1 align="center">Silero VAD</h1> <br/>

Silero VAD - pre-trained enterprise-grade Voice Activity Detector (also see our STT models).

<br/> <p align="center"> <img src="https://github.com/snakers4/silero-vad/assets/36505480/300bd062-4da5-4f19-9736-9c144a45d7a7" /> </p> <details> <summary>Real Time Example</summary>

https://user-images.githubusercontent.com/36505480/144874384-95f80f6d-a4f1-42cc-9be7-004c891dd481.mp4

Please note, that video loads only if you are logged in your GitHub account.

</details> <br/> <h2 align="center">Fast start</h2> <br/> <details> <summary>Dependencies</summary>

System requirements to run python examples on x86-64 systems:

Dependencies:

Silero VAD uses torchaudio library for audio I/O (torchaudio.info, torchaudio.load, and torchaudio.save), so a proper audio backend is required:

If you are planning to run the VAD using solely the onnx-runtime, it will run on any other system architectures where onnx-runtume is supported. In this case please note that:

</details>

Using pip: pip install silero-vad

from silero_vad import load_silero_vad, read_audio, get_speech_timestamps
model = load_silero_vad()
wav = read_audio('path_to_audio_file')
speech_timestamps = get_speech_timestamps(
  wav,
  model,
  return_seconds=True,  # Return speech timestamps in seconds (default is samples)
)

Using torch.hub:

import torch
torch.set_num_threads(1)

model, utils = torch.hub.load(repo_or_dir='snakers4/silero-vad', model='silero_vad')
(get_speech_timestamps, _, read_audio, _, _) = utils

wav = read_audio('path_to_audio_file')
speech_timestamps = get_speech_timestamps(
  wav,
  model,
  return_seconds=True,  # Return speech timestamps in seconds (default is samples)
)
<br/> <h2 align="center">Key Features</h2> <br/> <br/> <h2 align="center">Typical Use Cases</h2> <br/> <br/> <h2 align="center">Links</h2> <br/> <br/> <h2 align="center">Get In Touch</h2> <br/>

Try our models, create an issue, start a discussion, join our telegram chat, email us, read our news.

Please see our wiki for relevant information and email us directly.

Citations

@misc{Silero VAD,
  author = {Silero Team},
  title = {Silero VAD: pre-trained enterprise-grade Voice Activity Detector (VAD), Number Detector and Language Classifier},
  year = {2024},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/snakers4/silero-vad}},
  commit = {insert_some_commit_here},
  email = {hello@silero.ai}
}
<br/> <h2 align="center">Examples and VAD-based Community Apps</h2> <br/>