A Structured Self-attentive Sentence Embedding

Last update: Nov 28, 2022

Overview

Structured Self-attentive sentence embeddings

Implementation for the paper A Structured Self-Attentive Sentence Embedding, which was published in ICLR 2017: https://arxiv.org/abs/1703.03130 .

USAGE:

For binary sentiment classification on imdb dataset run : python classification.py "binary"

For multiclass classification on reuters dataset run : python classification.py "multiclass"

You can change the model parameters in the model_params.json file Other tranining parameters like number of attention hops etc can be configured in the config.json file.

If you want to use pretrained glove embeddings , set the use_embeddings parameter to "True" ,default is set to False. Do not forget to download the glove.6B.50d.txt and place it in the glove folder.

Implemented:

Classification using self attention
Regularization using Frobenius norm
Gradient clipping
Visualizing the attention weights

Instead of pruning ,used averaging over the sentence embeddings.

Visualization:

After training, the model is tested on 100 test points. Attention weights for the 100 test data are retrieved and used to visualize over the text using heatmaps. A file visualization.html gets saved in the visualization/ folder after successful training. The visualization code was provided by Zhouhan Lin (@hantek). Many thanks.

Below is a shot of the visualization on few datapoints.

Training accuracy 93.4% Tested on 1000 points with 90.2% accuracy

A Structured Self-attentive Sentence Embedding

Related tags

Overview

Structured Self-attentive sentence embeddings

USAGE:

Implemented:

Visualization:

Owner

Kaushal Shetty

deep-prae

Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)

TF Image Segmentation: Image Segmentation framework

A PyTorch-Based Framework for Deep Learning in Computer Vision

Code for Overinterpretation paper Overinterpretation reveals image classification model pathologies

Fast Differentiable Matrix Sqrt Root

Official implementation of the Neurips 2021 paper Searching Parameterized AP Loss for Object Detection.

DAN: Unfolding the Alternating Optimization for Blind Super Resolution

4K videos with annotated masks in our ICCV2021 paper 'Internal Video Inpainting by Implicit Long-range Propagation'.

[CVPR 2021] Few-shot 3D Point Cloud Semantic Segmentation

BanditPAM: Almost Linear-Time k-Medoids Clustering

Project for music generation system based on object tracking and CGAN

GND-Nets (Graph Neural Diffusion Networks) in TensorFlow.

Real-Time Seizure Detection using EEG: A Comprehensive Comparison of Recent Approaches under a Realistic Setting

Fast, differentiable sorting and ranking in PyTorch

Simple renderer for use with MuJoCo (>=2.1.2) Python Bindings.

URIE: Universal Image Enhancementfor Visual Recognition in the Wild

Zsseg.baseline - Zero-Shot Semantic Segmentation

Source code of our BMVC 2021 paper: AniFormer: Data-driven 3D Animation with Transformer

Experiments on Flood Segmentation on Sentinel-1 SAR Imagery with Cyclical Pseudo Labeling and Noisy Student Training