MaskGIT-pytorch

Pytorch implementation of MaskGIT: Masked Generative Image Transformer (https://arxiv.org/pdf/2202.04200.pdf)

Note: this is work in progress

MaskGIT is an extension to the VQGAN paper which improves the second stage transformer part (and leaves the first stage untouched). It switches the unidirectional transformer for a bidirectional transformer. The (second stage) training is pretty similar to BERT by randomly masking out tokens and trying to predict these using the bidirectional transformer (the original work used a GPT architecture randomly replaced tokens by other tokens). Different from BERT, the percentage for the masking is not fixed and uniformly distributed between 0 and 1 for each batch. Furhtermore, a new inference algorithm is suggested in which we start off by a completely masked-out image and then iteratively sample vectors where the model has a high confidence.

If you are only interested in the part of the code that comes from this paper check out transformer.py.

Run the code

The code is ready for training both the VQGAN and the Bidirectional Transformer and can also be used for inference

python training_vqgan.py

python training_transformer.py

(Make sure to edit the path for the dataset etc.)

TODO

Implement the gamma functions
Implement functions for image editing tasks: inpainting, extrapolation, image manipulation
Tune hyperparameters
(Provide visual results)

Pytorch implementation of MaskGIT: Masked Generative Image Transformer

Related tags

Overview

MaskGIT-pytorch

Note: this is work in progress

Run the code

TODO

Owner

Dominic Rampas

This repository contains several jupyter notebooks to help users learn to use neon, our deep learning framework

This tool converts a Nondeterministic Finite Automata (NFA) into a Deterministic Finite Automata (DFA)

It's A ML based Web Site build with python and Django to find the breed of the dog

MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution (CVPR2021)

A PyTorch-based library for fast prototyping and sharing of deep neural network models.

Official repository for: Continuous Control With Ensemble DeepDeterministic Policy Gradients

🏆 The 1st Place Submission to AICity Challenge 2021 Natural Language-Based Vehicle Retrieval Track (Alibaba-UTS submission)

Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021

A Partition Filter Network for Joint Entity and Relation Extraction EMNLP 2021

An evaluation toolkit for voice conversion models.

Neighborhood Reconstructing Autoencoders

Only a Matter of Style: Age Transformation Using a Style-Based Regression Model

CS506-Spring2022 - Code and Slides for Boston University CS 506

Trading environnement for RL agents, backtesting and training.

N-Omniglot is a large neuromorphic few-shot learning dataset

An automated algorithm to extract the linear blend skinning (LBS) from a set of example poses

Reinforcement Learning with Q-Learning Algorithm on gym's frozen lake environment implemented in python

🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"

Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)

Python-based Informatics Kit for Analysing Chemical Units