A GridMixup augmentation, inspired by GridMask and CutMix

Last update: Dec 28, 2022

Related tags

Deep Learning GridMixup

Overview

GridMixup

A GridMixup augmentation, inspired by GridMask and CutMix

Easy install

pip install git+https://github.com/IlyaDobrynin/GridMixup.git

Overview

This simple augmentation is inspired by the GridMask and CutMix augmentations. The combination of this two augmentations forms proposed method.

Example

To run simple examples notebooks, you should install requirements:

pip install -r requirements.txt

Simple examples are here: demo and pipeline demo

TlDr:

from gridmix import GridMixupLoss

gridmix_cls = GridMixupLoss(
    alpha=(0.4, 0.7),
    hole_aspect_ratio=1.,
    crop_area_ratio=(0.5, 1),
    crop_aspect_ratio=(0.5, 2),
    n_holes_x=(2, 6)
)

images, targets = batch['images'], batch['targets']
images_mixed, targets_mixed = gridmix_cls.get_sample(images=images, targets=targets)
preds = model(images_mixed)
loss = criterion(preds, targets_mixed)

Before

After

GridMixup loss defined as:

lam * CrossEntropyLoss(preds, trues1) + (1 - lam) * CrossEntropyLoss(preds, trues2)

where:

lam - the area of the main image
(1 - lam) - area of the secondary image

Parameters

GridMixupLoss takes follow arguments:

alpha - parameter define area of the main image in mixed image. Could be float or Tuple[float, float].
- if float: lambda parameter gets from the beta-dictribution np.random.beta(alpha, alpha);
- if Tuple[float, float]: lambda parameter gets from the uniform distribution np.random.uniform(alpha[0], alpha[1]).
n_holes_x - number of holes in crop by X axis.
hole_aspect_ratio - aspect ratio of holes.
crop_area_ratio - parameter define area of the secondary image on a mixed image.
crop_aspect_ratio - aspect ratio of crop.

A GridMixup augmentation, inspired by GridMask and CutMix

Related tags

Overview

GridMixup

Easy install

Overview

Example

Parameters

Owner

IlyaDo

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

Genpass - A Passwors Generator App With Python3

Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification

Online Pseudo Label Generation by Hierarchical Cluster Dynamics for Adaptive Person Re-identification

Consistency Regularization for Adversarial Robustness

Fast Learning of MNL Model From General Partial Rankings with Application to Network Formation Modeling

CVPR2022 paper "Dense Learning based Semi-Supervised Object Detection"

A modular PyTorch library for optical flow estimation using neural networks

[IROS2021] NYU-VPR: Long-Term Visual Place Recognition Benchmark with View Direction and Data Anonymization Influences

VQGAN+CLIP Colab Notebook with user-friendly interface.

SAMO: Streaming Architecture Mapping Optimisation

Generative Art Using Neural Visual Grammars and Dual Encoders

Scaling Vision with Sparse Mixture of Experts

[CVPR 2021] Released code for Counterfactual Zero-Shot and Open-Set Visual Recognition

Build and run Docker containers leveraging NVIDIA GPUs

From this paper "SESNet: A Semantically Enhanced Siamese Network for Remote Sensing Change Detection"

OneShot Learning-based hotword detection.

PyTorch implementation of some learning rate schedulers for deep learning researcher.

PoseCamera is python based SDK for human pose estimation through RGB webcam.

Progressive Image Deraining Networks: A Better and Simpler Baseline

A GridMixup augmentation, inspired by GridMask and CutMix

Related tags

Overview

GridMixup

Easy install

Overview

Example

Parameters

Owner

IlyaDo

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,

Genpass - A Passwors Generator App With Python3

Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification

Online Pseudo Label Generation by Hierarchical Cluster Dynamics for Adaptive Person Re-identification

Consistency Regularization for Adversarial Robustness

Fast Learning of MNL Model From General Partial Rankings with Application to Network Formation Modeling

CVPR2022 paper "Dense Learning based Semi-Supervised Object Detection"

A modular PyTorch library for optical flow estimation using neural networks

[IROS2021] NYU-VPR: Long-Term Visual Place Recognition Benchmark with View Direction and Data Anonymization Influences

VQGAN+CLIP Colab Notebook with user-friendly interface.

SAMO: Streaming Architecture Mapping Optimisation

Generative Art Using Neural Visual Grammars and Dual Encoders

Scaling Vision with Sparse Mixture of Experts

[CVPR 2021] Released code for Counterfactual Zero-Shot and Open-Set Visual Recognition

Build and run Docker containers leveraging NVIDIA GPUs

From this paper "SESNet: A Semantically Enhanced Siamese Network for Remote Sensing Change Detection"

OneShot Learning-based hotword detection.

PyTorch implementation of some learning rate schedulers for deep learning researcher.

PoseCamera is python based SDK for human pose estimation through RGB webcam.

Progressive Image Deraining Networks: A Better and Simpler Baseline

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder,