Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)

Last update: Jan 03, 2023

Overview

Hierarchical neural-net interpretations (ACD) 🧠

Produces hierarchical interpretations for a single prediction made by a pytorch neural network. Official code for Hierarchical interpretations for neural network predictions (ICLR 2019 pdf).

Documentation • Demo notebooks

Note: this repo is actively maintained. For any questions please file an issue.

examples/documentation

installation: pip install acd (or clone and run python setup.py install)
examples: the reproduce_figs folder has notebooks with many demos
src: the acd folder contains the source for the method implementation
allows for different types of interpretations by changing hyperparameters (explained in examples)
all required data/models/code for reproducing are included in the dsets folder

Inspecting NLP sentiment models	Detecting adversarial examples	Analyzing imagenet models

notes on using ACD on your own data

the current CD implementation often works out-of-the box, especially for networks built on common layers, such as alexnet/vgg/resnet. However, if you have custom layers or layers not accessible in net.modules(), you may need to write a custom function to iterate through some layers of your network (for examples see cd.py).
to use baselines such build-up and occlusion, replace the pred_ims function by a function, which gets predictions from your model given a batch of examples.

related work

CDEP (ICML 2020 pdf, github) - penalizes CD / ACD scores during training to make models generalize better
TRIM (ICLR 2020 workshop pdf, github) - using simple reparameterizations, allows for calculating disentangled importances to transformations of the input (e.g. assigning importances to different frequencies)
PDR framework (PNAS 2019 pdf) - an overarching framewwork for guiding and framing interpretable machine learning
DAC (arXiv 2019 pdf, github) - finds disentangled interpretations for random forests
Baseline interpretability methods - the file scores/score_funcs.py also contains simple pytorch implementations of integrated gradients and the simple interpration technique gradient * input

reference

feel free to use/share this code openly
if you find this code useful for your research, please cite the following:

@inproceedings{
   singh2019hierarchical,
   title={Hierarchical interpretations for neural network predictions},
   author={Chandan Singh and W. James Murdoch and Bin Yu},
   booktitle={International Conference on Learning Representations},
   year={2019},
   url={https://openreview.net/forum?id=SkEqro0ctQ},
}

Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)

Related tags

Overview

Hierarchical neural-net interpretations (ACD) 🧠

examples/documentation

notes on using ACD on your own data

related work

reference

Owner

Chandan Singh

A python library for decision tree visualization and model interpretation.

Visualizer for neural network, deep learning, and machine learning models

Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM

Implementation of linear CorEx and temporal CorEx.

Visual analysis and diagnostic tools to facilitate machine learning model selection.

Tool for visualizing attention in the Transformer model (BERT, GPT-2, Albert, XLNet, RoBERTa, CTRL, etc.)

A library for debugging/inspecting machine learning classifiers and explaining their predictions

Code for visualizing the loss landscape of neural nets

A library that implements fairness-aware machine learning algorithms

Net2Vis automatically generates abstract visualizations for convolutional neural networks from Keras code.

Contrastive Explanation (Foil Trees), developed at TNO/Utrecht University

L2X - Code for replicating the experiments in the paper Learning to Explain: An Information-Theoretic Perspective on Model Interpretation.

A game theoretic approach to explain the output of any machine learning model.

Neural network visualization toolkit for tf.keras

ModelChimp is an experiment tracker for Deep Learning and Machine Learning experiments.

tensorboard for pytorch (and chainer, mxnet, numpy, ...)

GNNLens2 is an interactive visualization tool for graph neural networks (GNN).

👋🦊 Xplique is a Python toolkit dedicated to explainability, currently based on Tensorflow.

Portal is the fastest way to load and visualize your deep neural networks on images and videos 🔮

A ultra-lightweight 3D renderer of the Tensorflow/Keras neural network architectures