A simple, unofficial implementation of MAE using pytorch-lightning

Last update: Dec 03, 2022

Related tags

Deep Learning mae-pytorch

Overview

Masked Autoencoders in PyTorch

A simple, unofficial implementation of MAE (Masked Autoencoders are Scalable Vision Learners) using pytorch-lightning.

Currently implements training on CUB and StanfordCars, but is easily extensible to any other image dataset.

Setup

.env">

# Clone the repository
git clone https://github.com/catalys1/mae-pytorch.git
cd mae-pytorch

# Install required libraries (inside a virtual environment preferably)
pip install -r requirements.txt

# Set up .env for path to data
echo "DATADIR=/path/to/data" > .env

Usage

MAE training

Training options are provided through configuration files, handled by LightningCLI. See configs/ for examples.

Train an MAE model on the CUB dataset:

python train.py fit --config=configs/mae.yaml --config=configs/data/cub_mae.yaml

Using multiple GPUs:

python train.py fit --config=configs/mae.yaml --config=configs/data/cub_mae.yaml --config=configs/multigpu.yaml

Fine-tuning

Not yet implemented.

Implementation

The default model uses ViT-Base for the encoder, and a small ViT (depth=4, width=192) for the decoder. This is smaller than the model used in the paper.

Dependencies

Configuration and training is handled completely by pytorch-lightning.
The MAE model uses the VisionTransformer from timm.
Interface to FGVC datasets through fgvcdata.
Configurable environment variables through python-dotenv.

Results

Image reconstructions of CUB validation set images after training with the following command:

python train.py fit --config=configs/mae.yaml --config=configs/data/cub_mae.yaml --config=configs/multigpu.yaml

A simple, unofficial implementation of MAE using pytorch-lightning

Related tags

Overview

Masked Autoencoders in PyTorch

Setup

Usage

MAE training

Fine-tuning

Implementation

Dependencies

Results

Owner

Connor Anderson

Encoding Causal Macrovariables

Theory-inspired Parameter Control Benchmarks for Dynamic Algorithm Configuration

An implementation of shampoo

Live training loss plot in Jupyter Notebook for Keras, PyTorch and others

DFFNet: An IoT-perceptive Dual Feature Fusion Network for General Real-time Semantic Segmentation

Runtime type annotations for the shape, dtype etc. of PyTorch Tensors.

Implement face detection, and age and gender classification, and emotion classification.

RADIal is available now! Check the download section

Torch-mutable-modules - Use in-place and assignment operations on PyTorch module parameters with support for autograd

constructing maps of intellectual influence from publication data

MTA:SA Server Configer.

Why Are You Weird? Infusing Interpretability in Isolation Forest for Anomaly Detection

Simulations for Turring patterns on an apically expanding domain. T

Reinforcement Learning for Portfolio Management

这是一个yolox-keras的源码，可以用于训练自己的模型。

Simulation code and tutorial for BBHnet training data

Super-BPD: Super Boundary-to-Pixel Direction for Fast Image Segmentation (CVPR 2020)

sktime companion package for deep learning based on TensorFlow

[PyTorch] Official implementation of CVPR2021 paper "PointDSC: Robust Point Cloud Registration using Deep Spatial Consistency". https://arxiv.org/abs/2103.05465

A script that trains a model to recognize handwritten digits using the MNIST data set.