A Structured Self-attentive Sentence Embedding

Last update: Nov 28, 2022

Overview

Structured Self-attentive sentence embeddings

Implementation for the paper A Structured Self-Attentive Sentence Embedding, which was published in ICLR 2017: https://arxiv.org/abs/1703.03130 .

USAGE:

For binary sentiment classification on imdb dataset run : python classification.py "binary"

For multiclass classification on reuters dataset run : python classification.py "multiclass"

You can change the model parameters in the model_params.json file Other tranining parameters like number of attention hops etc can be configured in the config.json file.

If you want to use pretrained glove embeddings , set the use_embeddings parameter to "True" ,default is set to False. Do not forget to download the glove.6B.50d.txt and place it in the glove folder.

Implemented:

Classification using self attention
Regularization using Frobenius norm
Gradient clipping
Visualizing the attention weights

Instead of pruning ,used averaging over the sentence embeddings.

Visualization:

After training, the model is tested on 100 test points. Attention weights for the 100 test data are retrieved and used to visualize over the text using heatmaps. A file visualization.html gets saved in the visualization/ folder after successful training. The visualization code was provided by Zhouhan Lin (@hantek). Many thanks.

Below is a shot of the visualization on few datapoints.

Training accuracy 93.4% Tested on 1000 points with 90.2% accuracy

A Structured Self-attentive Sentence Embedding

Related tags

Overview

Structured Self-attentive sentence embeddings

USAGE:

Implemented:

Visualization:

Owner

Kaushal Shetty

A simple Speech Emotion Recognition (SER) API created using Flask and running in a Docker container.

A python package to fine-tune transformer-based models for named entity recognition (NER).

A python framework to transform natural language questions to queries in a database query language.

Behavioral Testing of Clinical NLP Models

This project consists of data analysis and data visualization (done using python)of all IPL seasons from 2008 to 2019 and answering the most asked questions about the IPL.

Shellcode antivirus evasion framework

RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).

Nested Named Entity Recognition

BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model

A Fast Sequence Transducer Implementation with PyTorch Bindings

ThinkTwice: A Two-Stage Method for Long-Text Machine Reading Comprehension

Calibre recipe to convert latest issue of Analyse & Kritik into an ebook

Study German declensions (dER nettE Mann, ein nettER Mann, mit dEM nettEN Mann, ohne dEN nettEN Mann ...) Generate as many exercises as you want using the incredible power of SPACY!

The model is designed to train a single and large neural network in order to predict correct translation by reading the given sentence.

BERT score for text generation

pyupbit 라이브러리를 활용하여 upbit에서 비트코인을 자동매매하는 코드입니다. 조코딩 유튜브 채널에서 자세한 강의 영상을 보실 수 있습니다.

A Facebook Messenger Chatbot using NLP

Shared, streaming Python dict

Implementation of Multistream Transformers in Pytorch