A flask application to predict the speech emotion of any .wav file.

Last update: Dec 15, 2021

Overview

This is a speech emotion recognition app. It will allow you to train a modular MLP model with the RAVDESS dataset, and then use that model with a flask application to predict the speech emotion of any .wav file.

REQS:

To download the RAVDESS speech emotion recognition data, go to: https://drive.google.com/file/d/1wWsrN2Ep7x6lWqOXfr4rpKGYrJhWc8z7/view

for installing all dependencie simply open terminal and run:

. ./install_deps.sh

This should create your venv and populate it with all necessary dependencies

MODEL:

A multilayer perceptron model to detect the emotion of wav files. To create and edit the model see create_model.py Once the create_model.py is adjusted to your liking (emotions_to_observe, and path to sound data), simply run:

python3 create_model.py

to create the model.model binary file and test accuracy of your model

APP:

Once the model.model binary is created, you can spin up the flask application (ToneCheck): To do so run

. ./start_flask.sh

The app will run default on localhost:5000, the emotions available for predictions will correspond with the emotions_to_observe variable you have edited inside create_models.py (and are therefore available inside the model binary file)

A flask application to predict the speech emotion of any .wav file.

Related tags

Overview

REQS:

MODEL:

APP:

Owner

Aryan Vijaywargia

UniSpeech - Large Scale Self-Supervised Learning for Speech

Twitter-Sentiment-Analysis - Analysis of twitter posts' positive and negative score.

Code for Discovering Topics in Long-tailed Corpora with Causal Intervention.

Pytorch implementation of Tacotron

Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents

FedNLP: A Benchmarking Framework for Federated Learning in Natural Language Processing

Training open neural machine translation models

Source code for CsiNet and CRNet using Fully Connected Layer-Shared feedback architecture.

The model is designed to train a single and large neural network in order to predict correct translation by reading the given sentence.

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

nlp基础任务

Simple Python script to scrape youtube channles of "Parity Technologies and Web3 Foundation" and translate them to well-known braille language or any language

Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"

Sample data associated with the Aurora-BP study

DeLighT: Very Deep and Light-Weight Transformers

Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE

Community and sentiment analysis based on tweets

Intent parsing and slot filling in PyTorch with seq2seq + attention

A Flask Sentiment Analysis API, with visual implementation