BART aids transcribe tasks by taking a source audio file and creating automatic repeated loops, allowing transcribers to listen to fragments multiple times

Last update: Feb 04, 2022

Related tags

Audio bart

Overview

 ____   ____ ____  ______
|    \ /    |    \|      |
|  o  )  o  |  D  )      |
|     |     |    /|_|  |_|
|  O  |  _  |    \  |  |
|     |  |  |  .  \ |  |
|_____|__|__|__|\_| |__|

"Are you transcribing a conversation? BART can help!"

BART (Beyond Audio Replay Technology) aids transcribe tasks by taking a source audio file and creating automatic repeated loops, allowing transcribers to listen to fragments multiple times (with possible overlap between segments).

Installation

Make sure to clone this repository and install Python dependencies using:

pip install -r requirements.txt

BART relies on Pydub for processing audio. Pydub needs ffmepg or libav to be available on your system. Read this carefully.

Installing ffmpeg on MacOS

On my macOS system, I did the following.

Download ffmpeg and ffprobe binaries from: https://evermeet.cx/ffmpeg/
Unarchive downloaded files and move them to your $PATH, e.g.:

sudo cp ffmpeg /usr/local/bin
sudo cp ffprobe /usr/local/bin

Double check they are in your $PATH:

which ffmpeg || echo "ffmpeg not found"
which ffprobe || echo "ffprobe not found"

Usage

BART is very simple and expects all input files in the input/ directory.

Overmore, BART is very naive. It always writes output files to the output/ directory with exactly the same name as the input file. If the file exists, it will overwrite without warning.

BART is very chatty and will tell you when he adds a segment and when he adds a pause.

Example

python main.py --input test.mp3 --segment-length 10 --pause-length 1.0 --segment-step 5 --segment-repeat 2

Usage:

python main.py -h

BART aids transcribe tasks by taking a source audio file and creating automatic repeated loops, allowing transcribers to listen to fragments multiple times

Related tags

Overview

Installation

Installing ffmpeg on MacOS

Usage

Example

Owner

PatrikZero's CS:GO Hearing protection

This bot can stream audio or video files and urls in telegram voice chats

python wrapper for rubberband

Inner ear models for Python

This library provides common speech features for ASR including MFCCs and filterbank energies.

controls volume using hand gestures

Pythonic bindings for FFmpeg's libraries.

PyAbsorp is a python module that has the main focus to help estimate the Sound Absorption Coefficient.

A python script that can play .mp3 URLs upon the ringing or motion detection of a Ring doorbell. The sound plays through Sonos speakers.

Pianote - An application that helps musicians practice piano ear training

Audio processor to map oracle notes in the VoG raid in Destiny 2 to call outs.

A collection of python scripts for extracting and analyzing acoustics from audio files.

spafe: Simplified Python Audio-Features Extraction

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music

Desktop music recognition application for windows

Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21

A python program to cut longer MP3 files (i.e. recordings of several songs) into the individual tracks.

Jarvis From Basic to Advance - make a voice assistant similar to JARVIS (in iron man movie)

SinGlow: Generative Flow for SVS tasks in Tensorflow 2

BART aids transcribe tasks by taking a source audio file and creating automatic repeated loops, allowing transcribers to listen to fragments multiple times

Related tags

Overview

Installation

Installing ffmpeg on MacOS

Usage

Example

Owner

PatrikZero's CS:GO Hearing protection

This bot can stream audio or video files and urls in telegram voice chats

python wrapper for rubberband

Inner ear models for Python

This library provides common speech features for ASR including MFCCs and filterbank energies.

controls volume using hand gestures

﻿﻿Pythonic bindings for FFmpeg's libraries.

PyAbsorp is a python module that has the main focus to help estimate the Sound Absorption Coefficient.

A python script that can play .mp3 URLs upon the ringing or motion detection of a Ring doorbell. The sound plays through Sonos speakers.

Pianote - An application that helps musicians practice piano ear training

Audio processor to map oracle notes in the VoG raid in Destiny 2 to call outs.

A collection of python scripts for extracting and analyzing acoustics from audio files.

spafe: Simplified Python Audio-Features Extraction

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music

Desktop music recognition application for windows

Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21

A python program to cut longer MP3 files (i.e. recordings of several songs) into the individual tracks.

Jarvis From Basic to Advance - make a voice assistant similar to JARVIS (in iron man movie)

SinGlow: Generative Flow for SVS tasks in Tensorflow 2

Pythonic bindings for FFmpeg's libraries.