Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

Last update: Jan 01, 2023

Overview

ImageProcessingTransformer

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

The latest version contains some important modifications according to the official mindspore implementation. It makes convergecy a lot faster. Please make sure you update to the latest version.

only contain model definition file and train/test file. Dataloader file is not yet released. You could implement your own dataloader. It may be released in the next version.

To pretrain on random task

python main.py --seed 0 \
--lr 5e-5 \
--save-path "./ckpt" \
--epochs 300 \
--data path-to-data \
--batch-size 256

To finetune on a specific task

python main.py --seed 0 \
--lr 2e-5 \
--save-path "./ckpt" \
--epochs 30 \
--reset-epoch \
--data path-to-data \
--batch-size 256 \
--resume path-to-pretrain-model \
--task "dehaze"

To eval on a specific task

python main.py --seed 0 \
--eval-data path-to-val-data \
--batch-size 256 \
--eval \
--resume path-to-model \
--task "dehaze"

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

Related tags

Overview

ImageProcessingTransformer

Owner

Working demo of the Multi-class and Anomaly classification model using the CLIP feature space

Deep learning-based approach to discovering Granger causality networks in multivariate time series

Project page for End-to-end Recovery of Human Shape and Pose

Annotated, understandable, and visually interpretable PyTorch implementations of: VAE, BIRVAE, NSGAN, MMGAN, WGAN, WGANGP, LSGAN, DRAGAN, BEGAN, RaGAN, InfoGAN, fGAN, FisherGAN

[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code :star2:. Semi-supervised video object segmentation evaluation.

AdamW optimizer for bfloat16 models in pytorch.

Doods2 - API for detecting objects in images and video streams using Tensorflow

💃 VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena

PFFDTD is an open-source FDTD simulator for 3D room acoustics

Official Implementation of SWAD (NeurIPS 2021)

CTC segmentation python package

Progressive Domain Adaptation for Object Detection

An example of Scatterbrain implementation (combining local attention and Performer)

Airborne Optical Sectioning (AOS) is a wide synthetic-aperture imaging technique

code associated with ACL 2021 DExperts paper

The Unsupervised Reinforcement Learning Benchmark (URLB)

Scene-Text-Detection-and-Recognition (Pytorch)

CRF-RNN for Semantic Image Segmentation - PyTorch version

Cmsc11 arcade - Final Project for CMSC11

Extremely simple and fast extreme multi-class and multi-label classifiers.