EmoTag helps you train emotion detection model for Chinese audios

Last update: Sep 07, 2022

Overview

emoTag

emoTag helps you train emotion detection model for Chinese audios.

Environment

pip install -r requirement.txt

Data

We used Emotional Speech Dataset (ESD) for Speech Synthesis and Voice Conversion from HLT Singapore.

Train Emotion Classifier

Use this command to train a classifier. Adjust training setups in conf/logfbank_train-emo.json.

python train.py --config conf/logfbank_train-emo.json --name task_trial_1

Models and logs will be find in exp/.

usage: train.py [-h] [-c CONFIG] [-r RESUME] [-n NAME] [--lr LR] [--bs BS]
                [--train_utt2wav TRAIN_UTT2WAV] [--val_utt2wav VAL_UTT2WAV]
                [--blocks BLOCKS] [--optimizer OPTIMIZER]
                [--train_pad0 TRAIN_PAD0] [--devel_pad0 DEVEL_PAD0]
                [--pretrain PRETRAIN]

PyTorch Template

optional arguments:
  -h, --help            show this help message and exit
  -c CONFIG, --config CONFIG
                        config file path (default: None)
  -r RESUME, --resume RESUME
                        path to latest checkpoint (default: None)
  -n NAME, --name NAME
  --lr LR, --learning_rate LR
  --bs BS, --batch_size BS
  --train_utt2wav TRAIN_UTT2WAV
  --val_utt2wav VAL_UTT2WAV
  --blocks BLOCKS
  --optimizer OPTIMIZER
  --train_pad0 TRAIN_PAD0
  --devel_pad0 DEVEL_PAD0
  --pretrain PRETRAIN

Infer labels

python infer_label.py

Adjust the vad_file param and code if necessary to adapt to new tasks. infer_label.py adopted multiprocessing, increased cpu utilities rate and inference efficiency. See usage details below.

usage: infer_label.py [-h] [--vad_file VAD_FILE] [--model_dir MODEL_DIR]
                      [--output_dir OUTPUT_DIR]

parse model info

optional arguments:
  -h, --help            show this help message and exit
  --vad_file VAD_FILE
  --model_dir MODEL_DIR
  --output_dir OUTPUT_DIR

EmoTag helps you train emotion detection model for Chinese audios

Related tags

Overview

emoTag

Environment

Data

Train Emotion Classifier

Infer labels

Owner

_zza

Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch.

Allele-specific pipeline for unbiased read mapping(WIP), QTL discovery(WIP), and allelic-imbalance analysis

Collect some papers about transformer with vision. Awesome Transformer with Computer Vision (CV)

Implementation of Diverse Semantic Image Synthesis via Probability Distribution Modeling

An implementation of Deep Forest 2021.2.1.

Neural Scene Flow Fields using pytorch-lightning, with potential improvements

Image Processing, Image Smoothing, Edge Detection and Transforms

End-to-end speech secognition toolkit

SporeAgent: Reinforced Scene-level Plausibility for Object Pose Refinement

Object-Centric Learning with Slot Attention

PyTorch implementation of "Optimization Planning for 3D ConvNets"

A tool to prepare websites grabbed with wget for local viewing.

PyTorch for Semantic Segmentation

A Machine Teaching Framework for Scalable Recognition

The project covers common metrics for super-resolution performance evaluation.

A curated list of awesome game datasets, and tools to artificial intelligence in games

An educational AI robot based on NVIDIA Jetson Nano.

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

IDRLnet, a Python toolbox for modeling and solving problems through Physics-Informed Neural Network (PINN) systematically.

(ICCV'21) Official PyTorch implementation of Relational Embedding for Few-Shot Classification