Multilingual Emotion classification using BERT (fine-tuning). Published at the WASSA workshop (ACL2022).

Last update: Sep 17, 2022

Overview

XLM-EMO: Multilingual Emotion Prediction in Social Media Text

Abstract

Detecting emotion in text allows social and computational scientists to study how people behave and react to online events. However, developing these tools for different languages requires data that is not always available. This paper collects the available emotion detection datasets across 19 languages. We train a multilingual emotion prediction model for social media data, XLM-EMO. The model shows competitive performance in a zero-shot setting, suggesting it is helpful in the context of low-resource languages. We release our model to the community so that interested researchers can directly use it.

See the paper for additional details:

Bianchi, F., Nozza, & D., Hovy. "XLM-EMO: Multilingual Emotion Prediction in Social Media Text". In Proceedings of the 12th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (Forthcoming). Association for Computational Linguistics, 2022. Link.

Free software: MIT license

Installing

pip install -U xlm-emo

Important: If you want to use CUDA you need to install the correct version of the CUDA systems that matches your distribution, see PyTorch.

Features

from xlm_emo.classifier import  EmotionClassifier
ec = EmotionClassifier()

ec.predict(["senti testa di cazzo", "I am very happy"])

>> ["anger", "joy"]

Models

Model	Link	Macro F1 on Test Set
XLM-EMO-T	https://huggingface.co/MilaNLProc/xlm-emo-t	0.85
XLM-EMO-B	TBD	TBD
XLM-EMO-L	TBD	TBD

Reference

If you use this tool please cite the following paper:

@inproceedings{bianchi-etal-2022-xlmemo,
title = {{XLM-EMO}: Multilingual Emotion Prediction in Social Media Text},
author = "Bianchi, Federico and Nozza, Debora and Hovy, Dirk",
booktitle = "Proceedings of the 12th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis",
year = "2022",
publisher = "Association for Computational Linguistics"
}

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

Multilingual Emotion classification using BERT (fine-tuning). Published at the WASSA workshop (ACL2022).

Related tags

Overview

XLM-EMO: Multilingual Emotion Prediction in Social Media Text

Abstract

Installing

Features

Models

Reference

Credits

Owner

MilaNLP

Text to speech converter with GUI made in Python.

Behavioral Testing of Clinical NLP Models

This is a general repo that helps you develop fast/effective NLP classifiers using Huggingface

NLP Text Classification

Sample data associated with the Aurora-BP study

构建一个多源（公众号、RSS）、干净、个性化的阅读环境

Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃

Speech to text streamlit app

Python utility library for compositing PDF documents with reportlab.

⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).

NeuTex: Neural Texture Mapping for Volumetric Neural Rendering

The model is designed to train a single and large neural network in order to predict correct translation by reading the given sentence.

Count the frequency of letters or words in a text file and show a graph.

The implementation of Parameter Differentiation based Multilingual Neural Machine Translation

This project uses word frequency and Term Frequency-Inverse Document Frequency to summarize a text.

Translation to python of Chris Sims' optimization function

Multilingual finetuning of Machine Translation model on low-resource languages. Project for Deep Natural Language Processing course.

This project converts your human voice input to its text transcript and to an automated voice too.

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS)

A Japanese tokenizer based on recurrent neural networks