Python interface to the WebRTC Voice Activity Detector

Last update: Dec 22, 2022

Related tags

Audio py-webrtcvad

Overview

py-webrtcvad

This is a python interface to the WebRTC Voice Activity Detector (VAD). It is compatible with Python 2 and Python 3.

A VAD classifies a piece of audio data as being voiced or unvoiced. It can be useful for telephony and speech recognition.

The VAD that Google developed for the WebRTC project is reportedly one of the best available, being fast, modern and free.

How to use it

Install the webrtcvad module:
```
pip install webrtcvad
```
Create a Vad object:
```
import webrtcvad
vad = webrtcvad.Vad()
```
Optionally, set its aggressiveness mode, which is an integer between 0 and 3. 0 is the least aggressive about filtering out non-speech, 3 is the most aggressive. (You can also set the mode when you create the VAD, e.g. vad = webrtcvad.Vad(3)):
```
vad.set_mode(1)
```

Give it a short segment ("frame") of audio. The WebRTC VAD only accepts 16-bit mono PCM audio, sampled at 8000, 16000, 32000 or 48000 Hz. A frame must be either 10, 20, or 30 ms in duration:

# Run the VAD on 10 ms of silence. The result should be False.
sample_rate = 16000
frame_duration = 10  # ms
frame = b'\x00\x00' * int(sample_rate * frame_duration / 1000)
print 'Contains speech: %s' % (vad.is_speech(frame, sample_rate)

See example.py for a more detailed example that will process a .wav file, find the voiced segments, and write each one as a separate .wav.

How to run unit tests

To run unit tests:

pip install -e ".[dev]"
python setup.py test

History

2.0.10

Fixed memory leak. Thank you, bond005!

2.0.9

Improved example code. Added WebRTC license.

2.0.8

Fixed Windows compilation errors. Thank you, xiongyihui!

Python interface to the WebRTC Voice Activity Detector

Related tags

Overview

py-webrtcvad

How to use it

How to run unit tests

History

Owner

John Wiseman

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

PianoPlayer - Automatic fingering generator for piano scores

spafe: Simplified Python Audio-Features Extraction

Algorithmic Multi-Instrumental MIDI Continuation Implementation

Scrap electronic music charts into CSV files

Music Streaming Platform based on full implementation of DBSM

A simple music player, powered by Python, utilising various libraries such as Tkinter and Pygame

Pythonic bindings for FFmpeg's libraries.

Free and Open Source Channel/Group Voice chat music player for telegram with button support saavn playback support.

Vixtify - Python Controlled Music Player

voice assistant made with python that search for covid19 data(like total cases, deaths and etc) in a specific country

This bot can stream audio or video files and urls in telegram voice chats

We built this fully functioning Music player in Python. The music player allows you to play/pause and switch to different songs easily.

Python tools for the corpus analysis of popular music.

A music player designed for a University Project.

Anki vector Music ❤ is the best and only Telegram VC player with playlists, Multi Playback, Channel play and more

eyeD3 is a Python module and command line program for processing ID3 tags. Information about mp3 files (i.e bit rate, sample frequency, play time, etc.) is also provided. The formats supported are ID3v1 (1.0/1.1) and ID3v2 (2.3/2.4).

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

python script for getting mp3 files from yaoutube playlist

Synthesia but open source, made in python and free

Python interface to the WebRTC Voice Activity Detector

Related tags

Overview

py-webrtcvad

How to use it

How to run unit tests

History

Owner

John Wiseman

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

PianoPlayer - Automatic fingering generator for piano scores

spafe: Simplified Python Audio-Features Extraction

Algorithmic Multi-Instrumental MIDI Continuation Implementation

Scrap electronic music charts into CSV files

Music Streaming Platform based on full implementation of DBSM

A simple music player, powered by Python, utilising various libraries such as Tkinter and Pygame

﻿﻿Pythonic bindings for FFmpeg's libraries.

Free and Open Source Channel/Group Voice chat music player for telegram with button support saavn playback support.

Vixtify - Python Controlled Music Player

voice assistant made with python that search for covid19 data(like total cases, deaths and etc) in a specific country

This bot can stream audio or video files and urls in telegram voice chats

We built this fully functioning Music player in Python. The music player allows you to play/pause and switch to different songs easily.

Python tools for the corpus analysis of popular music.

A music player designed for a University Project.

Anki vector Music ❤ is the best and only Telegram VC player with playlists, Multi Playback, Channel play and more

eyeD3 is a Python module and command line program for processing ID3 tags. Information about mp3 files (i.e bit rate, sample frequency, play time, etc.) is also provided. The formats supported are ID3v1 (1.0/1.1) and ID3v2 (2.3/2.4).

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

python script for getting mp3 files from yaoutube playlist

Synthesia but open source, made in python and free

Pythonic bindings for FFmpeg's libraries.