Official pytorch implementation of Rainbow Memory (CVPR 2021)

Last update: Dec 17, 2022

Related tags

Overview

Rainbow Memory - Official PyTorch Implementation

Rainbow Memory: Continual Learning with a Memory of Diverse Samples
Jihwan Bang^*, Heesu Kim^*, YoungJoon Yoo, Jung-Woo Ha, Jonghyun Choi
CVPR 2021
Paper | Bibtex
(* indicates equal contribution)

NOTE: The code will be pushed to this repository soon.

Abstract

Continual learning is a realistic learning scenario for AI models. Prevalent scenario of continual learning, however, assumes disjoint sets of classes as tasks and is less realistic rather artificial. Instead, we focus on 'blurry' task boundary; where tasks shares classes and is more realistic and practical. To address such task, we argue the importance of diversity of samples in an episodic memory. To enhance the sample diversity in the memory, we propose a novel memory management strategy based on per-sample classification uncertainty and data augmentation, named Rainbow Memory (RM). With extensive empirical validations on MNIST, CIFAR10, CIFAR100, and ImageNet datasets, we show that the proposed method significantly improves the accuracy in blurry continual learning setups, outperforming state of the arts by large margins despite its simplicity.

Overview of the results of RM

The table is shown for last accuracy comparison in various datasets in Blurry10-Online. If you want to see more details, see the paper.

Methods	MNIST	CIFAR100	ImageNet
EWC	90.98±0.61	26.95±0.36	39.54
Rwalk	90.69±0.62	32.31±0.78	35.26
iCaRL	78.09±0.60	17.39±1.04	17.52
GDumb	88.51±0.52	27.19±0.65	21.52
BiC	77.75±1.27	13.01±0.24	37.20
RM w/o DA	92.65±0.33	34.09±1.41	37.96
RM	91.80±0.69	41.35±0.95	50.11

Updates

April 2nd, 2021: Initial upload only README
April 16th, 2021: Upload all the codes for experiments

Getting Started

Requirements

Python3
Pytorch (>1.0)
torchvision (>0.2)
numpy
pillow~=6.2.1
torch_optimizer
randaugment
easydict
pandas~=1.1.3

Datasets

All the datasets are saved in dataset directory by following formats as shown below.

[dataset name] 
    |_train
        |_[class1 name]
            |_00001.png
            |_00002.png 
            ...
        |_[class2 name]
            ... 
    |_test (val for ImageNet)
        |_[class1 name]
            |_00001.png
            |_00002.png
            ...
        |_[class2 name]
            ...

You can easily download the dataset following above format.

MNIST: https://github.com/hwany-j/mnist_png
CIFAR-10: https://github.com/hwany-j/cifar10_png
CIFAR-100: https://github.com/hwany-j/cifar100_png

For ImageNet, you should download the public site.

Usage

To run the experiments in the paper, you just run experiment.sh.

bash experiment.sh

For various experiments, you should know the role of each argument.

MODE: CIL methods. Our method is called rm. [joint, gdumb, icarl, rm, ewc, rwalk, bic] (joint calculates accuracy when training all the datasets at once.)
MEM_MANAGE: Memory management method. default uses the memory method which the paper originally used. [default, random, reservoir, uncertainty, prototype].
RND_SEED: Random Seed Number
DATASET: Dataset name [mnist, cifar10, cifar100, imagenet]
STREAM: The setting whether current task data can be seen iteratively or not. [online, offline]
EXP: Task setup [disjoint, blurry10, blurry30]
MEM_SIZE: Memory size cifar10: k={200, 500, 1000}, mnist: k=500, cifar100: k=2,000, imagenet: k=20,000
TRANS: Augmentation. Multiple choices [cutmix, cutout, randaug, autoaug]

Results

There are three types of logs during running experiments; logs, results, tensorboard. The log files are saved in logs directory, and the results which contains accuracy of each task are saved in results directory.

root_directory
    |_ logs 
        |_ [dataset]
            |_{mode}_{mem_manage}_{stream}_msz{k}_rnd{seed_num}_{trans}.log
            |_ ...
    |_ results
        |_ [dataset]
            |_{mode}_{mem_manage}_{stream}_msz{k}_rnd{seed_num}_{trans}.npy
            |_...

In addition, you can also use the tensorboard as following command.

tensorboard --logdir tensorboard

Citation

@inproceedings{jihwan2021rainbow,
  title={Rainbow Memory: Continual Learning with a Memory of Diverse Samples},
  author={Jihwan Bang, Heesu Kim, YoungJoon Yoo, Jung-Woo Ha, Jonghyun Choi},
  booktitle={CVPR},
  month={June},
  year={2021}
}

License

Copyright 2021-present NAVER Corp.

This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with this program.  If not, see .

Official pytorch implementation of Rainbow Memory (CVPR 2021)

Related tags

Overview

Rainbow Memory - Official PyTorch Implementation

Abstract

Overview of the results of RM

Updates

Getting Started

Requirements

Datasets

Usage

Results

Citation

License

Owner

Clova AI Research

AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty

CS506-Spring2022 - Code and Slides for Boston University CS 506

Edge-oriented Convolution Block for Real-time Super Resolution on Mobile Devices, ACM Multimedia 2021

Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021

codes for "Scheduled Sampling Based on Decoding Steps for Neural Machine Translation" (long paper of EMNLP-2022)

a project for 3D multi-object tracking

Learning Generative Models of Textured 3D Meshes from Real-World Images, ICCV 2021

Official implementation for the paper "Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection"

Official PyTorch implementation of MAAD: A Model and Dataset for Attended Awareness

The code for the NeurIPS 2021 paper "A Unified View of cGANs with and without Classifiers".

Answer a series of contextually-dependent questions like they may occur in natural human-to-human conversations.

The GitHub repository for the paper: “Time Series is a Special Sequence: Forecasting with Sample Convolution and Interaction“.

MetaDrive: Composing Diverse Scenarios for Generalizable Reinforcement Learning

Power Core Simulator!

Fermi Problems: A New Reasoning Challenge for AI

Adaptive, interpretable wavelets across domains (NeurIPS 2021)

Official Repository for "Robust On-Policy Data Collection for Data Efficient Policy Evaluation" (NeurIPS 2021 Workshop on OfflineRL).

Bulk2Space is a spatial deconvolution method based on deep learning frameworks

Metric learning algorithms in Python

Code repo for EMNLP21 paper "Zero-Shot Information Extraction as a Unified Text-to-Triple Translation"