AI-UPV at IberLEF-2021 EXIST task: Sexism Prediction in Spanish and English Tweets Using Monolingual and Multilingual BERT and Ensemble Models

Last update: Jun 08, 2022

Overview

AI-UPV at IberLEF-2021 EXIST task: Sexism Prediction in Spanish and English Tweets Using Monolingual and Multilingual BERT and Ensemble Models

Description

This repository contains the code for the paper Sexism Prediction in Spanish and English Tweets Using Monolingual and Multilingual BERT and Ensemble Models. This paper will be published at the SEPLN-WS-IberLEF 2021 (the 3rd Workshop on Iberian Languages Evaluation Forum at the SEPLN 2021 Conference) scientific event. Descriptions of the implementation and the dataset are contained in the paper (link: Paper is soon...).

Paper Abstract

The popularity of social media has created problems such as hate speech and sexism. The identification and classification of sexism in social media are very relevant tasks, as they would allow building a healthier social environment. Nevertheless, these tasks are considerably challenging. This work proposes a system to use multilingual and monolingual BERT and data points translation and ensemble strategies for sexism identification and classification in English and Spanish. It was conducted in the context of the sEXism Identification in Social neTworks shared 2021 (EXIST 2021) task, proposed by the Iberian Languages Evaluation Forum (IberLEF). The proposed system and its main components are described, and an in-depth hyperparameters analysis is conducted. The main results observed were: (i) the system obtained better results than the baseline model (multilingual BERT); (ii) ensemble models obtained better results than monolingual models; and (iii) the E6 model (ensemble model considering all individual models and the best standardized values) obtained the best accuracies and F1-scores for both tasks. This work obtained first place in both tasks at EXIST, with the highest accuracies (0.780 for task 1 and 0.658 for task 2) and F1-scores (F1-binary of 0.780 for task 1 and F1-macro of 0.579 for task 2).

Credits

EXIST shared Task Organizers

Task website: http://nlp.uned.es/exist2021/

Contact: [email protected]

AI-UPV at IberLEF-2021 EXIST task: Sexism Prediction in Spanish and English Tweets Using Monolingual and Multilingual BERT and Ensemble Models

Related tags

Overview

AI-UPV at IberLEF-2021 EXIST task: Sexism Prediction in Spanish and English Tweets Using Monolingual and Multilingual BERT and Ensemble Models

Description

Paper Abstract

Credits

Owner

Angel de Paula

A Python library for Deep Probabilistic Modeling

DiffStride: Learning strides in convolutional neural networks

Caffe: a fast open framework for deep learning.

An pytorch implementation of Masked Autoencoders Are Scalable Vision Learners

Interactive dimensionality reduction for large datasets

Docker containers of baseline agents for the Crafter environment

Python wrappers to the C++ library SymEngine, a fast C++ symbolic manipulation library.

Weight initialization schemes for PyTorch nn.Modules

Lyapunov-guided Deep Reinforcement Learning for Stable Online Computation Offloading in Mobile-Edge Computing Networks

CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

Augmenting Physical Models with Deep Networks for Complex Dynamics Forecasting

AbelNN: Deep Learning Python module from scratch

The code for our NeurIPS 2021 paper "Kernelized Heterogeneous Risk Minimization".

Scribble-Supervised LiDAR Semantic Segmentation, CVPR 2022 (ORAL)

This is the official implementation of VaxNeRF (Voxel-Accelearated NeRF).

PyTorch implementation of the paper: "Preference-Adaptive Meta-Learning for Cold-Start Recommendation", IJCAI, 2021.

Sequential model-based optimization with a `scipy.optimize` interface

Official code release for "Learned Spatial Representations for Few-shot Talking-Head Synthesis" ICCV 2021

Transferable Unrestricted Attacks, which won 1st place in CVPR’21 Security AI Challenger: Unrestricted Adversarial Attacks on ImageNet.

For IBM Quantum Challenge 2021 (May 20 - 26)