Changelog

Implementation of "Distribution Alignment: A Unified Framework for Long-tail Visual Recognition"(CVPR 2021)

We implement the classification, object detection and instance segmentation tasks based on our cvpods. The users should install cvpods first and run the experiments in this repo.

Changelog

7.28.2021 Update the DisAlign on ImageNet-LT(ResX50)
4.23.2021 Update the DisAlign on LVIS v0.5(Mask R-CNN + Res50)
4.12.2021 Update the README

0. How to Use

Step-1: Install the latest cvpods.
Step-2: cd cvpods
Step-3: Prepare dataset for different tasks.
Step-4: git clone https://github.com/Megvii-BaseDetection/DisAlign playground_disalign
Step-5: Enter one folder and run pods_train --num-gpus 8
Step-6: Use pods_test --num-gpus 8 to evaluate the last the checkpoint

1. Image Classification

We support the the following three datasets:

ImageNet-LT Dataset
iNaturalist-2018 Dataset
Place-LT Dataset

We refer the user to CLS_README for more details.

2. Object Detection/Instance Segmentation

We support the two versions of the LVIS dataset:

LVIS v0.5
LVIS v1.0

Highlight

To speedup the evaluation on LVIS dataset, we provide the C++ optimized evaluation api by modifying the coco_eval(C++) in cvpods.

The C++ version lvis_eval API will save ~30% time when calculating the mAP.

We provide support for the metric of AP_fixed and AP_pool proposed in large-vocab-devil
We will support more recent works on long-tail detection in this project(e.g. EQLv2, CenterNet2, etc.) in the future.

We refer the user to DET_README for more details.

3. Semantic Segmentation

We adopt the mmsegmentation as the codebase for runing all experiments of DisAlign. Currently, the user should use DisAlign_Seg for the semantic segmentation experiments. We will add the support for these experiments in cvpods in the future.

Acknowledgement

Thanks for the following projects:

Citing DisAlign

If you are using the DisAlign in your research or with to refer to the baseline results publised in this repo, please use the following BibTex entry.

@inproceedings{zhang2021disalign,
  title={Distribution Alignment: A Unified Framework for Long-tail Visual Recognition.},
  author={Zhang, Songyang and Li, Zeming and Yan, Shipeng and He, Xuming and Sun, Jian},
  booktitle={CVPR},
  year={2021}
}

License

This repo is released under the Apache 2.0 license. Please see the LICENSE file for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.src		.src
classification		classification
instance_seg		instance_seg
semantic_seg		semantic_seg
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.src

.src

classification

classification

instance_seg

instance_seg

semantic_seg

semantic_seg

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Changelog

0. How to Use

1. Image Classification

2. Object Detection/Instance Segmentation

3. Semantic Segmentation

Acknowledgement

Citing DisAlign

License

About

Releases 1

Packages

Languages

License

Megvii-BaseDetection/DisAlign

Folders and files

Latest commit

History

Repository files navigation

Changelog

0. How to Use

1. Image Classification

2. Object Detection/Instance Segmentation

3. Semantic Segmentation

Acknowledgement

Citing DisAlign

License

About

Resources

License

Stars

Watchers

Forks

Languages