A model which classifies reviews as positive or negative.

Last update: Feb 09, 2022

Related tags

Overview

SentiMent Analysis

In this project I built a model to classify movie reviews fromn the IMDB dataset of 50K reviews.

WordtoVec :

Neural networks only work with numeric(float) values ; so how do they work on sequence models ; initially the solution was to convert words to one-hot vectors this worked out to be fine but this didnt allow the model to generalize at all. Example a model which could predict I want some orange _______ . Tha answer here is juiceand the model could do so because it was in its vocab but the model fails when the sentenc has apple and apple is not in its vocab.

The above image shows one-hot representations of words in the vocabulary.

So how do we solve this problem ?

Enter word representations . Where each word is represented by some m dimensional vector (not as big as the vocabulary). So we learn embeddings for words in the training set . This was one by the computationally expensive skip-gram model and its upgraded forms in the past now we have Transformers(BERT) . Wordtovec is still relevant today .

The figure above shows vector representations of word embeddings and also their representations in vector space .

In this model I have used both pre-Trained word embeddings and ones which I have trained from scratch . Many NLP libraries provide pre - trained embeddings for eg Spacy , FastText etc . I have used Fasttext embeddings in the LSTM model .

Architectures used :

RNN Architecture

The first model I built was using RNN architecture and trained embedding matrix from scratch . RNN model takes sequence as input along with activations of last layer as input and each block hastwo outputs the activations and ycap the prediction.

LSTM Network (Long Short Term Memory Network) :

The second model for the purpose of classification was a BiLSTM(Bidirectional LSTM Model) model which solves the problem of vanishing gradients commonly seen in RNNs(also known as forgetting information) . Here pre-trained word embeddings were used.

Note: The parameters for each block are the same . We use the same block for all words in RNN an drelated architectures.

References

Pytorch Documentation

Wordtovec Paper

Colah's Blog on LSTMs

A model which classifies reviews as positive or negative.

Related tags

Overview

SentiMent Analysis

WordtoVec :

So how do we solve this problem ?

Architectures used :

RNN Architecture

LSTM Network (Long Short Term Memory Network) :

References

Owner

Rishabh Bali

Codebase for the solution that won first place and was awarded the most human-like agent in the 2021 NeurIPS Competition MineRL BASALT Challenge.

Experimenting with computer vision techniques to generate annotated image datasets from gameplay recordings automatically.

[ICCV21] Self-Calibrating Neural Radiance Fields

Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.

Lightweight Python library for adding real-time object tracking to any detector.

This is implementation of AlexNet(2012) with 3D Convolution on TensorFlow (AlexNet 3D).

Adversarial Color Enhancement: Generating Unrestricted Adversarial Images by Optimizing a Color Filter

Pose estimation for iOS and android using TensorFlow 2.0

Dataset for the Research2Clinics @ NeurIPS 2021 Paper: What Do You See in this Patient? Behavioral Testing of Clinical NLP Models

NHL 94 AI contests

Keras-tensorflow implementation of Fully Convolutional Networks for Semantic Segmentation（Unfinished）

rliable is an open-source Python library for reliable evaluation, even with a handful of runs, on reinforcement learning and machine learnings benchmarks.

This is a TensorFlow implementation for C2-Rec

ChebLieNet, a spectral graph neural network turned equivariant by Riemannian geometry on Lie groups.

Code and real data for the paper "Counterfactual Temporal Point Processes", available at arXiv.

a general-purpose Transformer based vision backbone

Computational Pathology Toolbox developed by TIA Centre, University of Warwick.

An open source app to help calm you down when needed.

Multiwavelets-based operator model

List some popular DeepFake models e.g. DeepFake, FaceSwap-MarekKowal, IPGAN, FaceShifter, FaceSwap-Nirkin, FSGAN, SimSwap, CihaNet, etc.