Saliency Guided Training

Code implementing "Improving Deep Learning Interpretability by Saliency Guided Training" by Aya Abdelsalam Ismail, Hector Corrada Bravo*, Soheil Feizi*.

Overview

Saliency methods have been widely used to highlight important input features in model predictions. Most existing methods use backpropagation on a modified gradient function to generate saliency maps. Thus, noisy gradients can result in unfaithful feature attributions. In this paper, we tackle this issue and introduce a saliency guided training† procedure for neural networks to reduce noisy gradients used in predictions while retaining the predictive performance of the model. Our saliency guided training procedure iteratively masks features with small and potentially noisy gradients while maximizing the similarity of model outputs for both masked and unmasked inputs. We apply the saliency guided training procedure to various synthetic and real data sets from computer vision, natural language processing, and time series across diverse neural architectures, including Recurrent Neural Networks, Convolutional Networks, and Transformers. Through qualitative and quantitative evaluations, we show that saliency guided training procedure significantly improves model interpretability across various domains while preserving its predictive performance.

Usage:

Create the following folder structure.

Scripts
    │
    ├── data
    ├── models
    └── outputs 
      ├── SaliencyValues
      ├── MaskedAcc
      └──Graphs

To run experiment cd Scripts

For MNIST Regular Training

For MNIST Experiments run regular training: python train_MNIST.py
To get accuracy drop run: python maskedAcc_MNIST.py

For MNIST Interpretable Training

Add interpretable training flags, here 50% of the features are masked during training with a random.
python train_MNIST.py --trainingType interpretable --featuresDroped 0.5 --RandomMasking
To get accuracy drop run with the same flags used in training:
python maskedAcc_MNIST.py --trainingType interpretable --featuresDroped 0.5 --RandomMasking

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
Scripts		Scripts
Neurips_2021_poster.001.png		Neurips_2021_poster.001.png
README.md		README.md
results.png		results.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scripts

Scripts

Neurips_2021_poster.001.png

Neurips_2021_poster.001.png

README.md

README.md

results.png

results.png

Repository files navigation

Saliency Guided Training

Overview

Usage:

For MNIST Regular Training

For MNIST Interpretable Training

About

Releases

Packages

Languages

ayaabdelsalam91/saliency_guided_training

Folders and files

Latest commit

History

Repository files navigation

Saliency Guided Training

Overview

Usage:

For MNIST Regular Training

For MNIST Interpretable Training

About

Resources

Stars

Watchers

Forks

Languages