SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

Last update: Dec 30, 2022

Overview

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

PyTorch implementation of SnapMix | paper

Method Overview

Cite

@inproceedings{huang2021snapmix,
    title={SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data},
    author={Shaoli Huang, Xinchao Wang, and Dacheng Tao},
    year={2021},
    booktitle={AAAI Conference on Artificial Intelligence},
}

Setup

Install Package Dependencies

torch
torchvision 
PyYAML
easydict
tqdm
scikit-learn
efficientnet_pytorch
pandas
opencv

Datasets

create a soft link to the dataset directory

CUB dataset

ln -s /your-path-to/CUB-dataset data/cub

Car dataset

ln -s /your-path-to/Car-dataset data/car

Aircraft dataset

ln -s /your-path-to/Aircraft-dataset data/aircraft

Training

Training with Imagenet pre-trained weights

1. Baseline and Baseline+

To train a model on CUB dataset using the Resnet-50 backbone,

python main.py # baseline

python main.py --midlevel # baseline+

To train model on other datasets using other network backbones, you can specify the following arguments:

--netname: name of network architectures (support 4 network families: ResNet,DenseNet,InceptionV3,EfficientNet)

--dataset: dataset name

For example,

python main.py --netname resnet18 --dataset cub # using the Resnet-18 backbone on CUB dataset

python main.py --netname efficientnet-b0 --dataset cub # using the EfficientNet-b0 backbone on CUB dataset

python main.py --netname inceptoinV3 --dataset aircraft # using the inceptionV3 backbone on Aircraft dataset

2. Training with mixing augmentation

Applying SnapMix in training ( we used the hyperparameter values (prob=1., beta=5) for SnapMix in most of the experiments.):

python main.py --mixmethod snapmix --beta 5 --netname resnet50 --dataset cub # baseline

python main.py --mixmethod snapmix --beta 5 --netname resnet50 --dataset cub --midlevel # baseline+

Applying other augmentation methods (currently support cutmix,cutout,and mixup) in training:

python main.py --mixmethod cutmix --beta 3 --netname resnet50 --dataset cub # training with CutMix

python main.py --mixmethod mixup --prob 0.5 --netname resnet50 --dataset cub # training with MixUp

3. Results

ResNet architecture.

Backbone	Method	CUB	Car	Aircraft
Resnet-18	Baseline	82.35%	91.15%	87.80%
Resnet-18	Baseline + SnapMix	84.29%	93.12%	90.17%
Resnet-34	Baseline	84.98%	92.02%	89.92%
Resnet-34	Baseline + SnapMix	87.06%	93.95%	92.36%
Resnet-50	Baseline	85.49%	93.04%	91.07%
Resnet-50	Baseline + SnapMix	87.75%	94.30%	92.08%
Resnet-101	Baseline	85.62%	93.09%	91.59%
Resnet-101	Baseline + SnapMix	88.45%	94.44%	93.74%
Resnet-50	Baseline+	87.13%	93.80%	91.68%
Resnet-50	Baseline+ + SnapMix	88.70%	95.00%	93.24%
Resnet-101	Baseline+	87.81%	93.94%	91.85%
Resnet-101	Baseline+ + SnapMix	89.32%	94.84%	94.05%

InceptionV3 architecture.

Backbone	Method	CUB
InceptionV3	Baseline	82.22%
InceptionV3	Baseline + SnapMix	85.54%

DenseNet architecture.

Backbone	Method	CUB
DenseNet121	Baseline	84.23%
DenseNet121	Baseline + SnapMix	87.42%

Training from scratch

To train a model without using ImageNet pretrained weights:

python main.py --mixmethod snapmix --prob 0.5 --netname resnet18 --dataset cub --pretrained 0 # resnet-18 backbone

python main.py --mixmethod snapmix --prob 0.5 --netname resnet50 --dataset cub --pretrained 0 # resnet-50 backbone

2. Results

Backbone	Method	CUB
Resnet-18	Baseline	64.98%
Resnet-18	Baseline + SnapMix	70.31%
Resnet-50	Baseline	66.92%
Resnet-50	Baseline + SnapMix	72.17%

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

Related tags

Overview

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

Method Overview

Cite

Setup

Install Package Dependencies

Datasets

Training

Training with Imagenet pre-trained weights

Training from scratch

Owner

DavidHuang

RCT-ART is an NLP pipeline built with spaCy for converting clinical trial result sentences into tables through jointly extracting intervention, outcome and outcome measure entities and their relations.

We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.

Computer Vision Paper Reviews with Key Summary of paper, End to End Code Practice and Jupyter Notebook converted papers

An evaluation toolkit for voice conversion models.

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Hypersearch weight debugging and losses tutorial

Simple and understandable swin-transformer OCR project

Code for the paper: Learning Adversarially Robust Representations via Worst-Case Mutual Information Maximization (https://arxiv.org/abs/2002.11798)

A library for preparing, training, and evaluating scalable deep learning hybrid recommender systems using PyTorch.

A user-friendly research and development tool built to standardize RL competency assessment for custom agents and environments.

Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch

Repositorio oficial del curso IIC2233 Programación Avanzada 🚀✨

Code for our CVPR 2021 Paper "Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes".

This project deals with the detection of skin lesions within the ISICs dataset using YOLOv3 Object Detection with Darknet.

Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image Style Transfer

Plug and play transformer you can find network structure and official complete code by clicking List

Nsdf: A mesh SDF with just some code we can directly paste into our raymarcher

SelfRemaster: SSL Speech Restoration

TensorFlow CNN for fast style transfer

Instance Semantic Segmentation List