This project converts your human voice input to its text transcript and to an automated voice too.

Last update: Oct 15, 2021

Overview

Human Voice to Automated Voice & Text

Introduction:

In this project, whenever you'll speak, it will turn your voice into a robot voice and furthermore it will also generate a transcript of the input.

Requirements:

Inorder for this project to run, you need to have the following two modules installed:

gTTS (pip install gTTS)
SpeechRecognition (pip install SpeechRecognition)

SpeechRecognition:

SpeechRecognition creates a new microphone instance. It further performs speech recognition on the human input voice coming from the microphone. It then converts the input voice into its corresponding text. This can also be referred to as creating a transcript of an input. If you want to input voice straight from the microphone like i have in this project, then you need to install "PyAudio version 0.2.11+".

gTTS (Google Text To Speech):

gTTS is a Python library to interface with the Google Text To Speech API. In this project, its being used to convert the transcript of the input (that we got from SpeechRecognition module) to a robotic voice and then save the file as mp3.

Note:

All the files have been commented for your ease. Furthermore you may also add further comments if you may.

Contact Info:

For further queries contact me at : [email protected]

This project converts your human voice input to its text transcript and to an automated voice too.

Related tags

Overview

Human Voice to Automated Voice & Text

Introduction:

Requirements:

SpeechRecognition:

gTTS (Google Text To Speech):

Note:

Contact Info:

Owner

Hassan Shahzad

Free and Open Source Machine Translation API. 100% self-hosted, offline capable and easy to setup.

Generating Korean Slogans with phonetic and structural repetition

Blackstone is a spaCy model and library for processing long-form, unstructured legal text

Code for "Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree Structures Inside Arguments".

Modified GPT using average pooling to reduce the softmax attention memory constraints.

Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech tagging and word segmentation.

Unlimited Call - Text Bombing Tool

Linking data between GBIF, Biodiverse, and Open Tree of Life

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

Create a semantic search engine with a neural network (i.e. BERT) whose knowledge base can be updated

Data preprocessing rosetta parser for python

Python library for parsing resumes using natural language processing and machine learning

Making text a first-class citizen in TensorFlow.

This is a project of data parallel that running on NLP tasks.

Collection of scripts to pinpoint obfuscated code

PUA Programming Language written in Python.

GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning

Beta Distribution Guided Aspect-aware Graph for Aspect Category Sentiment Analysis with Affective Knowledge. Proceedings of EMNLP 2021

The guide to tackle with the Text Summarization

LeBenchmark: a reproducible framework for assessing SSL from speech