Code for Overinterpretation paper Overinterpretation reveals image classification model pathologies

Last update: Dec 10, 2022

Related tags

Overview

Overinterpretation

This repository contains the code for the paper:

Overinterpretation reveals image classification model pathologies
Authors: Brandon Carter, Siddhartha Jain, Jonas Mueller, David Gifford

Introduction

Image classifiers are typically scored on their test set accuracy, but high accuracy can mask a subtle type of model failure. We find that high scoring convolutional neural networks (CNNs) on popular benchmarks exhibit troubling pathologies that allow them to display high accuracy even in the absence of semantically salient features. When a model provides a high-confidence decision without salient supporting input features, we say the classifier has overinterpreted its input, finding too much class-evidence in patterns that appear nonsensical to humans. Here, we demonstrate that neural networks trained on CIFAR-10 and ImageNet suffer from overinterpretation, and we find models on CIFAR-10 make confident predictions even when 95% of input images are masked and humans cannot discern salient features in the remaining pixel-subsets. Although these patterns portend potential model fragility in real-world deployment, they are in fact valid statistical patterns of the benchmark that alone suffice to attain high test accuracy. Unlike adversarial examples, overinterpretation relies upon unmodified image pixels. We find ensembling and input dropout can each help mitigate overinterpretation.

Usage

Dependencies

Python 3.7
PyTorch v1.5.0
torchvision v0.5.0

Full requirements in requirements.txt.

Overview

The overinterpretation pipeline can be understood as:

Train models on full images (train.py).
Run backward selection for all training and test images (run_sis_on_cifar.py).
Train new models on pixel-subsets of images and mask the remaining pixels (train.py).
Evaluate new models and compare accuracy to original models.

The relevant scripts for running this pipeline are train.py and run_sis_on_cifar.py. Each script contains usage examples in the docstring. train.py supports training models on full image data as well as pixel-subsets only (specified via command line arguments, usage examples in docstring).

Note that for CIFAR-10, when training models on pixel-subsets only, we keep 5% of pixels and mask the remaining 95% with zeros.

Citation

If you use our methods or code, please cite:

@inproceedings{overinterpretation,
  title={Overinterpretation reveals image classification model pathologies},
  author={Carter, Brandon and Jain, Siddhartha and Mueller, Jonas W and Gifford, David},
  journal={Advances in Neural Information Processing Systems},
  volume={34},
  year={2021}
}

Code for Overinterpretation paper Overinterpretation reveals image classification model pathologies

Related tags

Overview

Overinterpretation

Introduction

Usage

Dependencies

Overview

Citation

Owner

Gifford Lab, MIT CSAIL

Intent parsing and slot filling in PyTorch with seq2seq + attention

The official implementation of the CVPR 2021 paper FAPIS: a Few-shot Anchor-free Part-based Instance Segmenter

A collection of inference modules for fastai2

Python scripts for performing stereo depth estimation using the MobileStereoNet model in ONNX

Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).

The all new way to turn your boring vector meshes into the new fad in town; Voxels!

MMRazor: a model compression toolkit for model slimming and AutoML

Implement some metaheuristics and cost functions

NAVER BoostCamp Final Project

Winning Solution in NTIRE19 Challenges on Video Restoration and Enhancement (CVPR19 Workshops) - Video Restoration with Enhanced Deformable Convolutional Networks. EDVR has been merged into BasicSR and this repo is a mirror of BasicSR.

Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images (ICCV 2021)

Official Repository of NeurIPS2021 paper: PTR

GNEE - GAT Neural Event Embeddings

Python scripts form performing stereo depth estimation using the CoEx model in ONNX.

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

Fermi Problems: A New Reasoning Challenge for AI

A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.

Potato Disease Classification - Training, Rest APIs, and Frontend to test.

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

For encoding a text longer than 512 tokens, for example 800. Set max_pos to 800 during both preprocessing and training.