EmoTag helps you train emotion detection model for Chinese audios

Last update: Sep 07, 2022

Overview

emoTag

emoTag helps you train emotion detection model for Chinese audios.

Environment

pip install -r requirement.txt

Data

We used Emotional Speech Dataset (ESD) for Speech Synthesis and Voice Conversion from HLT Singapore.

Train Emotion Classifier

Use this command to train a classifier. Adjust training setups in conf/logfbank_train-emo.json.

python train.py --config conf/logfbank_train-emo.json --name task_trial_1

Models and logs will be find in exp/.

usage: train.py [-h] [-c CONFIG] [-r RESUME] [-n NAME] [--lr LR] [--bs BS]
                [--train_utt2wav TRAIN_UTT2WAV] [--val_utt2wav VAL_UTT2WAV]
                [--blocks BLOCKS] [--optimizer OPTIMIZER]
                [--train_pad0 TRAIN_PAD0] [--devel_pad0 DEVEL_PAD0]
                [--pretrain PRETRAIN]

PyTorch Template

optional arguments:
  -h, --help            show this help message and exit
  -c CONFIG, --config CONFIG
                        config file path (default: None)
  -r RESUME, --resume RESUME
                        path to latest checkpoint (default: None)
  -n NAME, --name NAME
  --lr LR, --learning_rate LR
  --bs BS, --batch_size BS
  --train_utt2wav TRAIN_UTT2WAV
  --val_utt2wav VAL_UTT2WAV
  --blocks BLOCKS
  --optimizer OPTIMIZER
  --train_pad0 TRAIN_PAD0
  --devel_pad0 DEVEL_PAD0
  --pretrain PRETRAIN

Infer labels

python infer_label.py

Adjust the vad_file param and code if necessary to adapt to new tasks. infer_label.py adopted multiprocessing, increased cpu utilities rate and inference efficiency. See usage details below.

usage: infer_label.py [-h] [--vad_file VAD_FILE] [--model_dir MODEL_DIR]
                      [--output_dir OUTPUT_DIR]

parse model info

optional arguments:
  -h, --help            show this help message and exit
  --vad_file VAD_FILE
  --model_dir MODEL_DIR
  --output_dir OUTPUT_DIR

EmoTag helps you train emotion detection model for Chinese audios

Related tags

Overview

emoTag

Environment

Data

Train Emotion Classifier

Infer labels

Owner

_zza

A PyTorch implementation of SIN: Superpixel Interpolation Network

Example repository for custom C++/CUDA operators for TorchScript

optimization routines for hyperparameter tuning

A collection of semantic image segmentation models implemented in TensorFlow

Training code and evaluation benchmarks for the "Self-Supervised Policy Adaptation during Deployment" paper.

Pretrained models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet.

Wordle-solver - Wordle answer generation program in python

Effect of Different Encodings and Distance Functions on Quantum Instance-based Classifiers

Histocartography is a framework bringing together AI and Digital Pathology

Research on Tabular Deep Learning (Python package & papers)

Model that predicts the probability of a Twitter user being anti-vaccination.

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Object-Centric Learning with Slot Attention

[CVPR'21] Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-view Transformation

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Potato Disease Classification - Training, Rest APIs, and Frontend to test.

Provided is code that demonstrates the training and evaluation of the work presented in the paper: "On the Detection of Digital Face Manipulation" published in CVPR 2020.

Matching python environment code for Lux AI 2021 Kaggle competition, and a gym interface for RL models.

Graph Convolutional Networks in PyTorch

Tzer: TVM Implementation of "Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation (OOPSLA'22)“.