This repository contains code for the paper "Disentangling Label Distribution for Long-tailed Visual Recognition", published at CVPR' 2021

Last update: Oct 18, 2022

Related tags

Deep Learning LADE

Overview

Disentangling Label Distribution for Long-tailed Visual Recognition (CVPR 2021)

Arxiv link
Blog post
This codebase is built on Causal Norm.

Install

conda create -n longtail pip python=3.7 -y
source activate longtail
conda install pytorch torchvision cudatoolkit=10.1 -c pytorch
pip install pyyaml tqdm matplotlib sklearn h5py tensorboard

Training

Preliminaries

Download pretrained caffe resnet152 model for Places-LT: please refer to link.
Prepare dataset: CIFAR-100, Places-LT, ImageNet-LT, iNaturalist 2018
- Please download those datasets following Decoupling.

CIFAR-100 training

For CIFAR-100 with imbalance ratio 0.01, using LADE:

python main.py --seed 1 --cfg config/CIFAR100_LT/lade.yaml --exp_name lade2021/cifar100_imb0.01_lade --cifar_imb_ratio 0.01 --remine_lambda 0.01 --alpha 0.1 --gpu 0

Places-LT training

For PC Softmax:

python main.py --seed 1 --cfg config/Places_LT/ce.yaml --exp_name lade2021/places_pc_softmax --lr 0.05 --gpu 0,1,2,3

For LADE:

python main.py --seed 1 --cfg config/Places_LT/lade.yaml --exp_name lade2021/places_lade --lr 0.05 --remine_lambda 0.1 --alpha 0.005 --gpu 0,1,2,3

ImageNet-LT training

For LADE:

python main.py --seed 1 --cfg config/ImageNet_LT/lade.yaml  --exp_name lade2021/imagenet_lade --lr 0.05 --remine_lambda 0.5 --alpha 0.05 --gpu 0,1,2,3

iNaturalist18 training

For LADE:

python main.py --seed 1 --cfg ./config/iNaturalist18/lade.yaml --exp_name lade2021/inat_lade --lr 0.1 --alpha 0.05 --gpu 0,1,2,3

Evaluate on shifted test set & Confidence calibration

For Imagenet (Section 4.3, 4.4):

./notebooks/imagenet-shift-calib.ipynb

For CIFAR-100 (Supplementary material):

./notebooks/cifar100-shift-calib.ipynb

License

The use of this software is released under BSD-3.

Citation

If you find our paper or this project helps your research, please kindly consider citing our paper in your publications.

@article{hong2020disentangling,
  title={Disentangling Label Distribution for Long-tailed Visual Recognition},
  author={Hong, Youngkyu and Han, Seungju and Choi, Kwanghee and Seo, Seokjun and Kim, Beomsu and Chang, Buru},
  journal={arXiv preprint arXiv:2012.00321},
  year={2020}
}

This repository contains code for the paper "Disentangling Label Distribution for Long-tailed Visual Recognition", published at CVPR' 2021

Related tags

Overview

Disentangling Label Distribution for Long-tailed Visual Recognition (CVPR 2021)

Install

Training

Preliminaries

CIFAR-100 training

Places-LT training

ImageNet-LT training

iNaturalist18 training

Evaluate on shifted test set & Confidence calibration

License

Citation

Owner

Hyperconnect

Jetson Nano-based smart camera system that measures crowd face mask usage in real-time.

Official Datasets and Implementation from our Paper "Video Class Agnostic Segmentation in Autonomous Driving".

AOT (Associating Objects with Transformers) in PyTorch

Federated_learning codes used for the the paper "Evaluation of Federated Learning Aggregation Algorithms" and "A Federated Learning Aggregation Algorithm for Pervasive Computing: Evaluation and Comparison"

Scenic: A Jax Library for Computer Vision and Beyond

deep learning model that learns to code with drawing in the Processing language

Code accompanying "Adaptive Methods for Aggregated Domain Generalization"

Detectron2 is FAIR's next-generation platform for object detection and segmentation.

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it

A Tensorfflow implementation of Attend, Infer, Repeat

Code repository for "Reducing Underflow in Mixed Precision Training by Gradient Scaling" presented at IJCAI '20

PSTR: End-to-End One-Step Person Search With Transformers (CVPR2022)

A PyTorch Implementation of SphereFace.

Official repository of IMPROVING DEEP IMAGE MATTING VIA LOCAL SMOOTHNESS ASSUMPTION.

[ICRA 2022] An opensource framework for cooperative detection. Official implementation for OPV2V.

PyTorch code of paper "LiVLR: A Lightweight Visual-Linguistic Reasoning Framework for Video Question Answering"

School of Artificial Intelligence at the Nanjing University (NJU)School of Artificial Intelligence at the Nanjing University (NJU)

AFL binary instrumentation

Fine-tune pretrained Convolutional Neural Networks with PyTorch

Convert Pytorch model to onnx or tflite, and the converted model can be visualized by Netron