Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

Last update: Dec 25, 2022

Related tags

Deep Learning NorCal

Overview

NorCal

Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

On Model Calibration for Long-Tailed Object Detection and Instance Segmentation.

Advances in Neural Information Processing Systems (NeurIPS), 2021.

Tai-Yu Pan*, Cheng Zhang*, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao.

Introduction

Vanilla models for object detection and instance segmentation suffer from the heavy bias toward detecting frequent objects in the long-tailed setting. Existing methods address this issue mostly during training, e.g., by re-sampling or re-weighting.

In this paper, we investigate a largely overlooked approach -- post-processing calibration of confidence scores. We propose NorCal, Normalized Calibration for long-tailed object detection and instance segmentation, a simple and straightforward recipe that reweighs the predicted scores of each class by its training sample size. We show that separately handling the background class and normalizing the scores over classes for each proposal are keys to achieving superior performance. On the LVIS dataset, NorCal can effectively improve nearly all the baseline models not only on rare classes but also on common and frequent classes. Finally, we conduct extensive analysis and ablation studies to offer insights into various modeling choices and mechanisms of our approach.

Installation

Install Detectron2 following the instructions.

Evaluation

Model evaluation can be done similarly:

cd /path/to/detectron2/projects/NorCal
python train_net.py --config-file configs/lvis_v0.5_mask_rcnn_R_50_FPN.yaml --eval-only MODEL.WEIGHTS /path/to/model_checkpoint TEST.CALIBRATION.GAMMA gamma

Citation

Please cite with the following bibtex if you find it useful.

@inproceedings{pan2021norcal,
  title={On Model Calibration for Long-Tailed Object Detection and Instance Segmentation},
  author={Pan, Tai-Yu and Zhang, Cheng and Li, Yandong and Hu, Hexiang and Xuan, Dong and Changpinyo, Soravit and Gong, Boqing and Chao, Wei-Lun},
  booktitle = {NeurIPS},
  year={2021}
}

Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

Related tags

Overview

NorCal

Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

Introduction

Installation

Evaluation

Citation

Owner

Tai-Yu (Daniel) Pan

(JMLR' 19) A Python Toolbox for Scalable Outlier Detection (Anomaly Detection)

Self-Correcting Quantum Many-Body Control using Reinforcement Learning with Tensor Networks

TOOD: Task-aligned One-stage Object Detection, ICCV2021 Oral

Img-process-manual - Utilize Python Numpy and Matplotlib to realize OpenCV baisc image processing function

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

Improving Calibration for Long-Tailed Recognition (CVPR2021)

Pytorch implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"

Very Deep Convolutional Networks for Large-Scale Image Recognition

Visualizer using audio and semantic analysis to explore BigGAN (Brock et al., 2018) latent space.

Code for CVPR 2021 oral paper "Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts"

Photographic Image Synthesis with Cascaded Refinement Networks - Pytorch Implementation

A Planar RGB-D SLAM which utilizes Manhattan World structure to provide optimal camera pose trajectory while also providing a sparse reconstruction containing points, lines and planes, and a dense surfel-based reconstruction.

S-attack library. Official implementation of two papers "Are socially-aware trajectory prediction models really socially-aware?" and "Vehicle trajectory prediction works, but not everywhere".

Serving PyTorch 1.0 Models as a Web Server in C++

Sky Computing: Accelerating Geo-distributed Computing in Federated Learning

mbrl-lib is a toolbox for facilitating development of Model-Based Reinforcement Learning algorithms.

Neural Articulated Radiance Field

code for ICCV 2021 paper 'Generalized Source-free Domain Adaptation'

3D AffordanceNet is a 3D point cloud benchmark consisting of 23k shapes from 23 semantic object categories, annotated with 56k affordance annotations and covering 18 visual affordance categories.

Semantic similarity computation with different state-of-the-art metrics