Anuvada: Interpretable Models for NLP using PyTorch

So, you want to know why your classifier arrived at a particular decision or why your flashy new deep learning classification model is not performing in the way which you would want it to perform? Or there could be bias in your dataset towards a particular class and you want to understand if there are any such edge cases.

One of the common criticisms of deep learning has been it's black box nature (life itself is a big black box, not at all interpretable, don't even ask me about love). To address this issue, researchers have developed many ways to visualise and explain the inference. It is not necessary that a model has to be explainable, but when important decisions like which jobs to recommend to a person or whether to give a person loan are being made, it would be helpful to cross-check the model's claims. In such domains, self-explainable models are necessary.

This library is an ongoing effort to provide a high-level access to such models by building on top of PyTorch.

Here is what you can expect to visualize from a trained model.

Note: This model is a convolutional neural network trained on IMDB sentiment analysis dataset. I trained the model using SGD till validation loss stopped improving. Here is sensitivity analysis on some sample inputs. You can find more details about training the model in the Jupyter notebooks from the examples directory.

Positive review

Negative review

Installing

Clone this repo and add it to your python library path.

Requirements

PyTorch
NumPy
Pandas
Spacy
Gensim
tqdm

To do list

Acknowledgments

https://github.com/henryre/pytorch-fitmodule

Anuvada: Interpretable Models for NLP using PyTorch

Related tags

Overview

Anuvada: Interpretable Models for NLP using PyTorch

Positive review

Negative review

Installing

Requirements

To do list

Acknowledgments

Owner

EDGE

Script to generate VAD dataset used in Asteroid recipe

The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.

Grading tools for Advanced NLP (11-711)Grading tools for Advanced NLP (11-711)

SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples

Various Algorithms for Short Text Mining

TunBERT is the first release of a pre-trained BERT model for the Tunisian dialect using a Tunisian Common-Crawl-based dataset.

Search for documents in a domain through Google. The objective is to extract metadata

Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.

A Python wrapper for simple offline real-time dictation (speech-to-text) and speaker-recognition using Vosk.

Code Implementation of "Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction".

Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.

TaCL: Improve BERT Pre-training with Token-aware Contrastive Learning

QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries

Gold standard corpus annotated with verb-preverb connections for Hungarian.

This simple Python program calculates a love score based on your and your crush's full names in English

Mysticbbs-rjam - rJAM splitscreen message reader for MysticBBS A46+

A library for end-to-end learning of embedding index and retrieval model

Stuff related to Ben Eater's 8bit breadboard computer

DeLighT: Very Deep and Light-Weight Transformers

DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference