Measuring if attention is explanation with ROAR

Last update: Nov 13, 2022

Related tags

Overview

NLP ROAR Interpretability

Official code for: Evaluating the Faithfulness of Importance Measures in NLP by Recursively Masking Allegedly Important Tokens and Retraining

Install

git clone https://github.com/AndreasMadsen/nlp-roar-interpretability.git
cd nlp-roar-interpretability
python -m pip install -e .

Experiments

Tasks

There are scripts for each dataset. Note that some tasks share a dataset. Use this list to identify how to train a model for each task.

SST: python experiments/stanford_sentiment.py
SNLI: python experiments/stanford_nli.py
IMDB: python experiments/imdb.py
MIMIC (Diabetes): python experiments/mimic.py --subset diabetes
MIMIC (Anemia): python experiments/mimic.py --subset anemia
bABI-1: python experiments/babi.py --task 1
bABI-2: python experiments/babi.py --task 2
bABI-3: python experiments/babi.py --task 3

Parameters

Each of the above scripts stanford_sentiment, stanford_nli, imdb, mimic, and babi take the same set of CLI arguments. You can learn about each argument with --help. The most important arguments which will allow you to run the experiments presented in the paper are:

--importance-measure: this specifies which importance measure is used. It can be either random, mutual-information, attention , gradient, or integrated-gradient.
--seed: specifies the seed used to initialize the model.
--roar-strategy: should ROAR masking be done absoloute (count) or relative (quantile),
--k: the proportion of tokens in % to mask if --roar-strategy quantile is used. The number of tokens if --roar-strategy count is used.
--recursive: indicates that model to use for computing the importance measure has --k set to --k - --recursive-step-size instead of 0 as used in classic ROAR.

Note, for --k > 0, the reference model must already be trained. For example, in the non-recursive case, this means that a model trained with --k 0 must already available.

Running on a HPC setup

For downloading dataset dependencies we provide a download.sh script.

Additionally, we provide script for submitting all jobs to a Slurm queue, in batch_jobs/. Note again, that the ROAR script assume there are checkpoints for the baseline --k 0 models.

The jobs automatically use $SCRATCH/nlproar as the presistent dir.

MIMIC

See https://mimic.physionet.org/gettingstarted/access/ for how to access MIMIC. You will need to download DIAGNOSES_ICD.csv.gz and NOTEEVENTS.csv.gz and place them in mimic/ relative to your presistent dir.

Measuring if attention is explanation with ROAR

Related tags

Overview

NLP ROAR Interpretability

Install

Experiments

Tasks

Parameters

Running on a HPC setup

MIMIC

Owner

Andreas Madsen

Text to Image Generation with Semantic-Spatial Aware GAN

A map update dataset and benchmark

Infrastructure as Code (IaC) for a self-hosted version of Gnosis Safe on AWS

This repository implements variational graph auto encoder by Thomas Kipf.

This is the face keypoint train code of project face-detection-project

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

YOLOX + ROS(1, 2) object detection package

[NeurIPS-2020] Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID.

A model to classify a piece of news as REAL or FAKE

Code for the paper "SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness" (NeurIPS 2021)

The hippynn python package - a modular library for atomistic machine learning with pytorch.

A Weakly Supervised Amodal Segmenter with Boundary Uncertainty Estimation

GRF: Learning a General Radiance Field for 3D Representation and Rendering

这是一个yolox-pytorch的源码，可以用于训练自己的模型。

This is a TensorFlow implementation for C2-Rec

10x faster matrix and vector operations

tf2-keras implement yolov5

The Python3 import playground

Betafold - AlphaFold with tunings

Pyramid Scene Parsing Network, CVPR2017.

Measuring if attention is explanation with ROAR

Related tags

Overview

NLP ROAR Interpretability

Install

Experiments

Tasks

Parameters

Running on a HPC setup

MIMIC

Owner

Andreas Madsen

Text to Image Generation with Semantic-Spatial Aware GAN

A map update dataset and benchmark

Infrastructure as Code (IaC) for a self-hosted version of Gnosis Safe on AWS

This repository implements variational graph auto encoder by Thomas Kipf.

This is the face keypoint train code of project face-detection-project

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

YOLOX + ROS(1, 2) object detection package

[NeurIPS-2020] Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID.

A model to classify a piece of news as REAL or FAKE

Code for the paper "SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness" (NeurIPS 2021)

The hippynn python package - a modular library for atomistic machine learning with pytorch.

A Weakly Supervised Amodal Segmenter with Boundary Uncertainty Estimation

GRF: Learning a General Radiance Field for 3D Representation and Rendering

这是一个yolox-pytorch的源码，可以用于训练自己的模型。

This is a TensorFlow implementation for C2-Rec

10x faster matrix and vector operations

tf2-keras implement yolov5

The Python3 import playground

Betafold - AlphaFold with tunings

Pyramid Scene Parsing Network, CVPR2017.

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,