Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Last update: Oct 27, 2022

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Overview of paths used in DIG and IG. w is the word being attributed. The gray region is the neighborhood of w. Green line depicts the straight-line path from w to w' used by IG and the green squares are the corresponding interpolation points. Left: In DIG-Greedy, we first monotonize each word in the neighborhood (red arrow). Then the word closest to its corresponding monotonic point is selected as the anchor (blue line to w_5 since the red arrow of w_5 has the shortest magnitude). Right: In DIG-MaxCount we first count the number of monotonic dimensions for each word in the neighborhood (shown in [.] above). Then, the word with the highest number of monotonic dimensions is selected as the anchor word (blue line to w_4), followed by changing the non-monotonic dimensions of w_4 (red line to c). Repeating this step gives the zigzag blue path. Finally, the red stars are the interpolated points used by our method. Please refer to the paper for more details.

Dependencies

Dependencies can be installed using requirements.txt.

Evaluating DIG:

Install all the requirements from requirements.txt.
Execute ./setup.sh for setting up the folder hierarchy for experiments.

Commands for reproducing the reported results on DistilBERT fine-tuned on SST2:

# Generate the KNN graph
python knn.py -dataset sst2 -nn distilbert

# DIG (strategy: Greedy)
python main.py -dataset sst2 -nn distilbert -strategy greedy

# DIG (strategy: MaxCount)
python main.py -dataset sst2 -nn distilbert -strategy maxcount

Similarly, commands can be changed for other settings.

Please contact Soumya for any clarifications or suggestions.

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Dependencies

Evaluating DIG:

Owner

INK Lab @ USC

Highway networks implemented in PyTorch.

Select, weight and analyze complex sample data

Publication describing 3 ML examples at NSLS-II and interfacing into Bluesky

The authors' official PyTorch SigWGAN implementation

Official PyTorch Implementation of paper "NeLF: Neural Light-transport Field for Single Portrait View Synthesis and Relighting", EGSR 2021.

Gluon CV Toolkit

DeepAL: Deep Active Learning in Python

Simple cross-platform application for DaVinci surgical video frame annotation

Neural Scene Flow Fields using pytorch-lightning, with potential improvements

Fewshot-face-translation-GAN - Generative adversarial networks integrating modules from FUNIT and SPADE for face-swapping.

MegEngine implementation of YOLOX

Unsupervised Feature Ranking via Attribute Networks.

Code for Domain Adaptive Video Segmentation via Temporal Consistency Regularization in ICCV 2021

MVFNet: Multi-View Fusion Network for Efficient Video Recognition (AAAI 2021)

Our CIKM21 Paper "Incorporating Query Reformulating Behavior into Web Search Evaluation"

Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation.

10th place solution for Google Smartphone Decimeter Challenge at kaggle.

PyTorch implementation of Histogram Layers from DeepHist: Differentiable Joint and Color Histogram Layers for Image-to-Image Translation

Official PyTorch Implementation for InfoSwap: Information Bottleneck Disentanglement for Identity Swapping

Source code for the paper "Periodic Traveling Waves in an Integro-Difference Equation With Non-Monotonic Growth and Strong Allee Effect"