Terminal-based audio-to-text converter

Last update: Dec 15, 2022

Overview

att

Terminal-based audio-to-text converter

Project description

A terminal-based audio-to-text converter written in python, enabling you to convert .wav files or microphone input into text and save it to a file.

Requirements

To run the main python modules att_wav.py and mtt.py, you need to install the following packages:

speech_recognition
pydub
time
pyaudio

The installation method depends on the environment/ package manager you are using. The following examples show the installation of pydub for a standard python environment with pip and for an Anaconda environment via conda.

pip install pydub

conda install -c conda-forge pydub

License

This code is licensed under GPL-3.0 License.

Usage

To convert an audio file to text, start a terminal session, navigate to the location of the required module (e.g. att_wav.py) and start a python shell running the code by typing python att_wav.py.

Note that the att_wav.py can only handle .wav files due to the implementation of the underlying speech recognition API.

Hardware & Software Requirements

These programs can be run without much computing power. They can be executed on any modern device fullfilling minimal RAM/ CPU standards.

Terminal-based audio-to-text converter

Related tags

Overview

att

Project description

Requirements

License

Usage

Hardware & Software Requirements

Owner

Sven Eschlbeck

Analyze, visualize and process sound field data recorded by spherical microphone arrays.

Music bot of # Owner

Delta TTA(Text To Audio) SoftWare

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Audio fingerprinting and recognition in Python

SomaFM Plugin for Kodi

Free and Open Source Channel/Group Voice chat music player for telegram with button support saavn playback support.

Converting UGG files from Rode Wireless Go II transmitters (unsompressed recordings) to WAV format

DeepMusic is an easy to use Spotify like app to manage and listen to your favorites musics.

Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.

User-friendly Voice Cloning Application

Music player - endlessly plays your music

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXのコア

OpenClubhouse - A third-part web application based on flask to play Clubhouse audio.

Speech Algorithms Collections

A voice control utility for Spotify

convert-to-opus-cli is a Python CLI program for converting audio files to opus audio format.

Audio augmentations library for PyTorch for audio in the time-domain

Frescobaldi LilyPond Editor

An 8D music player made to enjoy Halloween this year!🤘