I label phrases on a scale of five values: negative, somewhat negative, neutral, somewhat positive, positive

Last update: Jan 13, 2022

Overview

Sentiment-of-movie-reviews

I label phrases on a scale of five values: negative, somewhat negative, neutral, somewhat positive, positive. Obstacles like sentence negation, sarcasm, terseness, language ambiguity, and many others make this task very challenging.

This project uses datasets available on kaggle for training and testing.

Transformers brings all these models together and makes it very easy to use each with only a few lines of code. In fact they even provide us with cool tools like pipelines or live demo that we can classify our text without any training or long periods of coding. But as you can geuss these simple and ready to use models have their weaknesses. For example, you can't classify the text with them with the number of labels you want because they've been pretrained on a text with specific labels. Also not all models used by them are as strong and accurate as we want them to be(for example the default model for sentiment analysis is uncased distillbert which is not the best model we can find out there). With all these in mind, we want to train .Transformers models on our own data with the models that we prefer.

I label phrases on a scale of five values: negative, somewhat negative, neutral, somewhat positive, positive

Related tags

Overview

Sentiment-of-movie-reviews

Owner

Text Classification Using LSTM

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

The model is designed to train a single and large neural network in order to predict correct translation by reading the given sentence.

Grover is a model for Neural Fake News -- both generation and detectio

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

Pipeline for training LSA models using Scikit-Learn.

Chinese Pre-Trained Language Models (CPM-LM) Version-I

ByT5: Towards a token-free future with pre-trained byte-to-byte models

COVID-19 Chatbot with Rasa 2.0: open source conversational AI

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

Code-autocomplete, a code completion plugin for Python

Weaviate demo with the text2vec-openai module

초성 해석기 based on ko-BART

This is Assignment1 code for the Web Data Processing System.

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Kerberoast with ACL abuse capabilities

Nystromformer: A Nystrom-based Algorithm for Approximating Self-Attention

Code release for "COTR: Correspondence Transformer for Matching Across Images"

Open Source Neural Machine Translation in PyTorch