Speech Recognition is an important feature in several applications used such as home automation, artificial intelligence

Last update: Feb 13, 2022

Overview

Speech Recognition is an important feature in several applications used such as home automation, artificial intelligence, etc. This article aims to provide an introduction on how to make use of the SpeechRecognition and pyttsx3 library of Python.

Installation required:

Python Speech Recognition module: pip install speechrecognition PyAudio: Use the following command for linux users sudo apt-get install python3-pyaudio Windows users can install pyaudio by executing the following command in a terminal

pip install pyaudio Python pyttsx3 module: pip install pyttsx3 Speech Input Using a Microphone and Translation of Speech to Text

Allow Adjusting for Ambient Noise: Since the surrounding noise varies, we must allow the program a second or too to adjust the energy threshold of recording so it is adjusted according to the external noise level. Speech to text translation: This is done with the help of Google Speech Recognition. This requires an active internet connection to work. However, there are certain offline Recognition systems such as PocketSphinx, but have a very rigorous installation process that requires several dependencies. Google Speech Recognition is one of the easiest to use. Translation of Speech to Text:

First, we need to import the library and then initialize it using init() function. This function may take 2 arguments.

init(driverName string, debug bool) drivername: [Name of available driver] sapi5 on Windows | nsss on MacOS debug: to enable or disable debug output After initialization, we will make the program speak the text using say() function. This method may also take 2 arguments.

say(text unicode, name string) text: Any text you wish to hear. name: To set a name for this speech. (optional) Finally, to run the speech we use runAndWait() All the say() texts won’t be said unless the interpreter encounters runAndWait().

Speech Recognition is an important feature in several applications used such as home automation, artificial intelligence

Related tags

Overview

Owner

RISHABH MISHRA

Controlling the MicriSpotAI robot from scratch

This repository implements and evaluates convolutional networks on the Möbius strip as toy model instantiations of Coordinate Independent Convolutional Networks.

Lightweight plotting to the terminal. 4x resolution via Unicode.

Stereo Hybrid Event-Frame (SHEF) Cameras for 3D Perception, IROS 2021

Sharing of contents on mitochondrial encounter networks

A PyTorch implementation of the paper "Semantic Image Synthesis via Adversarial Learning" in ICCV 2017

An implementation for the loss function proposed in Decoupled Contrastive Loss paper.

TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

Python 3 module to print out long strings of text with intervals of time inbetween

Minimal But Practical Image Classifier Pipline Using Pytorch, Finetune on ResNet18, Got 99% Accuracy on Own Small Datasets.

Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals

Curved Projection Reformation

Forecasting Nonverbal Social Signals during Dyadic Interactions with Generative Adversarial Neural Networks

Using deep learning to predict gene structures of the coding genes in DNA sequences of Arabidopsis thaliana

Awesome Remote Sensing Toolkit based on PaddlePaddle.

CVAT is free, online, interactive video and image annotation tool for computer vision

Contour-guided image completion with perceptual grouping (BMVC 2021 publication)

A dual benchmarking study of visual forgery and visual forensics techniques

MAUS: A Dataset for Mental Workload Assessment Using Wearable Sensor - Baseline system

A tool to estimate time varying instantaneous reproduction number during epidemics