Official implementation of the paper "AAVAE: Augmentation-AugmentedVariational Autoencoders"

Last update: Dec 12, 2022

Related tags

Overview

AAVAE

Official implementation of the paper "AAVAE: Augmentation-AugmentedVariational Autoencoders"

Abstract

Recent methods for self-supervised learning can be grouped into two paradigms: contrastive and non-contrastive approaches. Their success can largely be attributed to data augmentation pipelines which generate multiple views of a single input that preserve the underlying semantics. In this work, we introduce augmentation-augmented variational autoencoders (AAVAE), a third approach to self-supervised learning based on autoencoding. We derive AAVAE starting from the conventional variational autoencoder (VAE), by replacing the KL divergence regularization, which is agnostic to the input domain, with data augmentations that explicitly encourage the internal representations to encode domain-specific invariances and equivariances. We empirically evaluate the proposed AAVAE on image classification, similar to how recent contrastive and non-contrastive learning algorithms have been evaluated. Our experiments confirm the effectiveness of data augmentation as a replacement for KL divergence regularization. The AAVAE outperforms the VAE by 30% on CIFAR-10 and 40% on STL-10. The results for AAVAE are largely comparable to the state-of-the-art for self-supervised learning.

Training

To train the AAVAE model

Create a python virtual environment.
python setup.py install.
Train using python src/vae.py --denoising.

To reproduce the results from the paper on CIFAR-10:

python src/vae.py \
    --gpus 1 \
    --max_epochs 3200 \
    --batch_size 256 \
    --warmup_epochs 10 \
    --val_samples 16 \
    --weight_decay 0 \
    --logscale 0 \
    --kl_coeff 0 \
    --learning_rate 2.5e-4

To evaluate the pretrained encoder

python src/linear_eval.py --ckpt_path "path\to\saved\file.ckpt"

Saved checkpoints

Model	Dataset	Checkpoint	Downstream acc.
AAVAE	CIFAR-10	checkpoint	87.14
AAVAE	STL-10	checkpoint	84.72

Official implementation of the paper "AAVAE: Augmentation-AugmentedVariational Autoencoders"

Related tags

Overview

AAVAE

Abstract

Training

Saved checkpoints

Owner

Grid AI Labs

Code for ICMI2020 and ICMI2021 papers: "Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle" and "ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle"

PyTorch Implementation of Backbone of PicoDet

A general framework for deep learning experiments under PyTorch based on pytorch-lightning

Transport Mode detection - can detect the mode of transport with the help of features such as acceeration,jerk etc

🏆 The 1st Place Submission to AICity Challenge 2021 Natural Language-Based Vehicle Retrieval Track (Alibaba-UTS submission)

iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis

Torch-based tool for quantizing high-dimensional vectors using additive codebooks

Clinica is a software platform for clinical research studies involving patients with neurological and psychiatric diseases and the acquisition of multimodal data

Very deep VAEs in JAX/Flax

TreeSubstitutionCipher - Encryption system based on trees and substitution

A pytorch implementation of Reading Wikipedia to Answer Open-Domain Questions.

Deep Learning as a Cloud API Service.

Torch-mutable-modules - Use in-place and assignment operations on PyTorch module parameters with support for autograd

CSAC - Collaborative Semantic Aggregation and Calibration for Separated Domain Generalization

This repository contains PyTorch models for SpecTr (Spectral Transformer).

Code for the paper: Learning Adversarially Robust Representations via Worst-Case Mutual Information Maximization (https://arxiv.org/abs/2002.11798)

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

This is an example implementation of the paper "Cross Domain Robot Imitation with Invariant Representation".

DivNoising is an unsupervised denoising method to generate diverse denoised samples for any noisy input image. This repository contains the code to reproduce the results reported in the paper https://openreview.net/pdf?id=agHLCOBM5jP

FADNet++: Real-Time and Accurate Disparity Estimation with Configurable Networks