A combination of autoregressors and autoencoders using XLNet for sentiment analysis

Last update: Nov 20, 2021

Overview

A combination of autoregressors and autoencoders using XLNet for sentiment analysis

Abstract

In this paper sentiment analysis has been performed in order to evaluate the performance of XLNet on this particular task. XLNet is rather a ground-breaking network on language understanding which uses the perks of both autoregressive models and autoencoders. While BERT uses autoencoders and Transformers use autoregression, XLNet combines the aforementioned networks’ attributes in order to achieve higher performance in many NLP tasks, such as sentiment analysis, question answering, reading comprehension, natural language understanding etc. In this work we evaluate the XLNet model in several sentiment classification tasks in terms of accuracy and efficiency. The XLNet reaches state of the art results and outperforms BERT which is the previous state of the art model on natural language processing.

This was an assignment for the course of Deep learning in PhD program of National Technical Unicersity of Athens

Team composed of 3 persons
Runs has been made on HPC-ARIS through batch scripts
Course grade 10/10 (excellent)
Full report formatted as a paper in here
Code for 2 sentiment analysis tasks out of 3 (implemented by the author of this repo) in here
Data available here

A combination of autoregressors and autoencoders using XLNet for sentiment analysis

Related tags

Overview

A combination of autoregressors and autoencoders using XLNet for sentiment analysis

Abstract

This was an assignment for the course of Deep learning in PhD program of National Technical Unicersity of Athens

Owner

James Zaridis

Finds snippets in iambic pentameter in English-language text and tries to combine them to a rhyming sonnet.

Phrase-BERT: Improved Phrase Embeddings from BERT with an Application to Corpus Exploration

Use AutoModelForSeq2SeqLM in Huggingface Transformers to train COMET

Speech Recognition Database Management with python

Interactive Jupyter Notebook Environment for using the GPT-3 Instruct API

Extract rooms type, door, neibour rooms, rooms corners nad bounding boxes, and generate graph from rplan dataset

Yet another Python binding for fastText

Lingtrain Aligner — ML powered library for the accurate texts alignment.

Blazing fast language detection using fastText model

An ActivityWatch watcher to pose questions to the user and record her answers.

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

Collection of scripts to pinpoint obfuscated code

A Plover python dictionary allowing for consistent symbol input with specification of attachment and capitalisation in one stroke.

EasyTransfer is designed to make the development of transfer learning in NLP applications easier.

A unified tokenization tool for Images, Chinese and English.

🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.

Edge-Augmented Graph Transformer

Library for Russian imprecise rhymes generation

The official implementation of "BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?, ACL 2021 main conference"

skweak: A software toolkit for weak supervision applied to NLP tasks