Global-Local Attention for Emotion Recognition

Last update: Apr 21, 2022

Related tags

Overview

Global-Local Attention for Emotion Recognition

Requirements

Python 3
Install tensorflow (or tensorflow-gpu) >= 2.0.0
Install some other packages

pip install cython
pip install opencv-python==4.3.0.36 matplotlib numpy==1.18.5 dlib

Dataset

We provide the NCAER-S dataset with original images and extracted faces (a .txt file with 4 bounding box coordinate) in the NCAERS dataset.

The dataset can be downloaded at Google Drive

Note that the dataset and label should have structure like the followings:

NCAER-S 
│
└───images
│   │
│   └───class_1
│   │   │   img1.jpg
│   │   │   img2.jpg
│   │   │   ...
│   └───class_2
│       │   img1.jpg
│       │   img2.jpg
│       │   ...
│   
└───crop
│   │
│   └───class_1
│   │   │   img1.txt
│   │   │   img2.txt
│   │   │   ...
│   └───class_2
│       │   img1.txt
│       │   img2.txt
│       │   ...

Running

Our code supports these types of execution with argument -m or --mode:

#extract faces from <train, val or test> dataset (specified in config.py)
python run.py -m extract dataset_type=train

#train the model with config specified in the config.py
python run.py -m train 

#evaluate the trained model on the dataset <dataset_type>
python run.py -m eval --dataset_type=test --trained_weights=path/to/weights

Evaluation

Our trained model is available at weights/glamor-net/Model.

Firstly, please download the dataset and extract it into "data/" directory.
Then specified the path to the test data (images and crop):

config = config.copy({
    'test_images': 'path_to_test_images',
    'test_crop':   'path_to_test_cropped_faces' #(.txt files),
})

Run this command to evaluate the model. We are using the classification accuracy as our evaluation metric.

# Evaluate our model in the test set
python run.py -m eval --dataset_type=test --trained_weights=weights/glamor-net/Model

Training

Firstly please extract the faces from train set (val set is optional)

Specify the path to the dataset in config.py (train_images, val_images, test_images)
Specify the desired face-extracted output path in config.py (train_crop, val_crop, test_crop)

config = config.copy({

    'train_images': 'path_to_training_images',
    'train_crop':   'path_to_training_cropped_faces' #(.txt files),

    'val_images': 'path_to_validation_images',
    'val_crop':   'path_to_validation_cropped_faces' #(.txt files)

})

Perform face extraction on both dataset_type by running the commands:

python run.py -m extract --dataset_type=<train, val or test>

Start training:

# Train a new model from sratch
python run.py -m train 

# Continue training a model that you had trained earlier
python run.py -m train --resume=path/to/trained_weights

# Resume the last checkpoint model
python run.py -m train --resume=last

Prediction

We support prediction on single image or on images in a directory by running this command:

# Predict on single image
python predict.py --trained_weights=weights/glamor-net/Model --input=test_images/1.jpg --output=path/to/out/directory

# Predict on images in directory
python predict.py --trained_weights=weights/glamor-net/Model --input=test_images/ --output=out/

Use the help option to see a description of all available command line arguments

Global-Local Attention for Emotion Recognition

Related tags

Overview

Global-Local Attention for Emotion Recognition

Requirements

Dataset

Running

Evaluation

Training

Prediction

Use the help option to see a description of all available command line arguments

Owner

Minh Nhat Le

Reimplementation of the paper `Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? (ACL2020)`

Official code for Next Check-ins Prediction via History and Friendship on Location-Based Social Networks (MDM 2018)

Diffusion Normalizing Flow (DiffFlow) Neurips2021

A python implementation of Yolov5 to detect fire or smoke in the wild in Jetson Xavier nx and Jetson nano

U-Net: Convolutional Networks for Biomedical Image Segmentation

JDet is Object Detection Framework based on Jittor.

The implementation of 'Image synthesis via semantic composition'.

这是一个deeplabv3-plus-pytorch的源码，可以用于训练自己的模型。

A framework for joint super-resolution and image synthesis, without requiring real training data

People log into different sites every day to get information and browse through these sites one by one

This porject is intented to build the most accurate model for predicting the porbability of loan default

A Python library for Deep Probabilistic Modeling

LF-YOLO (Lighter and Faster YOLO) is used to detect defect of X-ray weld image.

Production First and Production Ready End-to-End Speech Recognition Toolkit

Deep Surface Reconstruction from Point Clouds with Visibility Information

TensorFlow-based implementation of "ICNet for Real-Time Semantic Segmentation on High-Resolution Images".

Si Adek Keras is software VR dangerous object detection.

Nested cross-validation is necessary to avoid biased model performance in embedded feature selection in high-dimensional data with tiny sample sizes

The implementation of the paper "A Deep Feature Aggregation Network for Accurate Indoor Camera Localization".

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。