Learning Confidence for Out-of-Distribution Detection in Neural Networks

Last update: Jan 05, 2023

Related tags

Overview

Learning Confidence Estimates for Neural Networks

This repository contains the code for the paper Learning Confidence for Out-of-Distribution Detection in Neural Networks. In this work, we demonstrate how to augment neural networks with a confidence estimation branch, which can be used to identify misclassified and out-of-distribution examples.

To learn confidence estimates during training, we provide the neural network with "hints" towards the correct output whenever it exhibits low confidence in its predictions. Hints are provided by pushing the prediction closer to the target distribution via interpolation, where the amount of interpolation proportional to the network's confidence that its prediction is correct. To discourage the network from always asking for free hints, a small penalty is applied whenever it is not confident. As a result, the network learns to only produce low confidence estimates when it is likely to make an incorrect prediction.

Bibtex:

@article{devries2018learning,
  title={Learning Confidence for Out-of-Distribution Detection in Neural Networks},
  author={DeVries, Terrance and Taylor, Graham W},
  journal={arXiv preprint arXiv:1802.04865},
  year={2018}
}

Results and Usage

We evalute our method on the task of out-of-distribution detection using three different neural network architectures: DenseNet, WideResNet, and VGG. CIFAR-10 and SVHN are used as the in-distribution datasets, while TinyImageNet, LSUN, iSUN, uniform noise, and Gaussian noise are used as the out-of-distribution datasets. Definitions of evaluation metrics can be found in the paper.

Dependencies

PyTorch v0.3.0
tqdm
visdom
seaborn
Pillow
scikit-learn

Training

Train a model with a confidence estimator with train.py. During training you can use visdom to see a histogram of confidence estimates from the test set. Training logs will be stored in the logs/ folder, while checkpoints are stored in the checkpoints/ folder.

Args	Options	Description
dataset	cifar10, svhn	Selects which dataset to train on.
model	densenet, wideresnet, vgg13	Selects which model architecture to use.
batch_size	[int]	Number of samples per batch.
epochs	[int]	Number of epochs for training.
seed	[int]	Random seed.
learning_rate	[float]	Learning rate.
data_augmentation		Train with standard data augmentation (random flipping and translation).
cutout	[int]	Indicates the patch size to use for Cutout. If 0, Cutout is not used.
budget	[float]	Controls how often the network can choose have low confidence in its prediction. Increasing the budget will bias the output towards low confidence predictions, while decreasing the budget will produce more high confidence predictions.
baseline		Train the model without the confidence branch.

Use the following settings to replicate the experiments from the paper:

VGG13 on CIFAR-10

python train.py --dataset cifar10 --model vgg13 --budget 0.3 --data_augmentation --cutout 16

WideResNet on CIFAR-10

python train.py --dataset cifar10 --model wideresnet --budget 0.3 --data_augmentation --cutout 16

DenseNet on CIFAR-10

python train.py --dataset cifar10 --model densenet --budget 0.3 --epochs 300 --batch_size 64 --data_augmentation --cutout 16

VGG13 on SVHN

python train.py --dataset svhn --model vgg13 --budget 0.3 --learning_rate 0.01 --epochs 160 --data_augmentation --cutout 20

WideResNet on SVHN

python train.py --dataset svhn --model wideresnet --budget 0.3 --learning_rate 0.01 --epochs 160 --data_augmentation --cutout 20

DenseNet on SVHN

python train.py --dataset svhn --model densenet --budget 0.3 --learning_rate 0.01 --epochs 300 --batch_size 64  --data_augmentation --cutout 20

Out-of-distribution detection

Evaluate a trained model with out_of_distribution_detection.py. Before running this you will need to download the out-of-distribution datasets from Shiyu Liang's ODIN github repo and modify the data paths in the file according to where you saved the datasets.

Args	Options	Description
ind_dataset	cifar10, svhn	Indicates which dataset to use as in-distribution. Should be the same one that the model was trained on.
ood_dataset	tinyImageNet_crop, tinyImageNet_resize, LSUN_crop, LSUN_resize, iSUN, Uniform, Gaussian, all	Indicates which dataset to use as the out-of-distribution datset.
model	densenet, wideresnet, vgg13	Selects which model architecture to use. Should be the same one that the model was trained on.
process	baseline, ODIN, confidence, confidence_scaling	Indicates which method to use for out-of-distribution detection. Baseline uses the maximum softmax probability. ODIN applies temperature scaling and input pre-processing to the baseline method. Confidence uses the learned confidence estimates. Confidence scaling applies input pre-processing to the confidence estimates.
batch_size	[int]	Number of samples per batch.
T	[float]	Temperature to use for temperature scaling.
epsilon	[float]	Noise magnitude to use for input pre-processing.
checkpoint	[str]	Filename of trained model checkpoint. Assumes the file is in the checkpoints/ folder. A .pt extension is also automatically added to the filename.
validation		Use this flag for fine-tuning T and epsilon. If flag is on, the script will only evaluate on the first 1000 samples in the out-of-distribution dataset. If flag is not used, the remaining samples are used for evaluation. Based on validation procedure from ODIN.

Example commands for running the out-of-distribution detection script:

Baseline

python out_of_distribution_detection.py --ind_dataset svhn --ood_dataset all --model vgg13 --process baseline --checkpoint svhn_vgg13_budget_0.0_seed_0

ODIN

python out_of_distribution_detection.py --ind_dataset cifar10 --ood_dataset tinyImageNet_resize --model densenet --process ODIN --T 1000 --epsilon 0.001 --checkpoint cifar10_densenet_budget_0.0_seed_0

Confidence

python out_of_distribution_detection.py --ind_dataset cifar10 --ood_dataset LSUN_crop --model vgg13 --process confidence --checkpoint cifar10_vgg13_budget_0.3_seed_0

Confidence scaling

python out_of_distribution_detection.py --ind_dataset svhn --ood_dataset iSUN --model wideresnet --process confidence_scaling --epsilon 0.001 --checkpoint svhn_wideresnet_budget_0.3_seed_0

Learning Confidence for Out-of-Distribution Detection in Neural Networks

Related tags

Overview

Learning Confidence Estimates for Neural Networks

Results and Usage

Dependencies

Training

Out-of-distribution detection

Owner

CHERRY is a python library for predicting the interactions between viral and prokaryotic genomes

Code for paper: Group-CAM: Group Score-Weighted Visual Explanations for Deep Convolutional Networks

RCT-ART is an NLP pipeline built with spaCy for converting clinical trial result sentences into tables through jointly extracting intervention, outcome and outcome measure entities and their relations.

Politecnico of Turin Thesis: "Implementation and Evaluation of an Educational Chatbot based on NLP Techniques"

Implementation for our ICCV 2021 paper: Dual-Camera Super-Resolution with Aligned Attention Modules

A Python module for the generation and training of an entry-level feedforward neural network.

End-to-end beat and downbeat tracking in the time domain.

This is the code for ACL2021 paper A Unified Generative Framework for Aspect-Based Sentiment Analysis

[NeurIPS 2021 Spotlight] Code for Learning to Compose Visual Relations

PyElecCL - Electron Monte Carlo Second Checks

code from "Tensor decomposition of higher-order correlations by nonlinear Hebbian plasticity"

A small library for creating and manipulating custom JAX Pytree classes

Reimplementation of Learning Mesh-based Simulation With Graph Networks

Speech-Emotion-Analyzer - The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

NeurIPS 2021 Datasets and Benchmarks Track

The toolkit to generate auto labeled datasets

Enhancing Aspect-Based Sentiment Analysis with Supervised Contrastive Learning.

MOpt-AFL provided by the paper "MOPT: Optimized Mutation Scheduling for Fuzzers"

Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.

The codebase for our paper "Generative Occupancy Fields for 3D Surface-Aware Image Synthesis" (NeurIPS 2021)