Imaginaire - NVIDIA's Deep Imagination Team's PyTorch Library

Last update: Dec 29, 2022

Related tags

Overview

Imaginaire

Docs | License | Installation | Model Zoo

Imaginaire is a pytorch library that contains optimized implementation of several image and video synthesis methods developed at NVIDIA.

License

Imaginaire is released under NVIDIA Software license. For commercial use, please consult NVIDIA Research Inquiries.

What's inside?

We have a tutorial for each model. Click on the model name, and your browser should take you to the tutorial page for the project.

Supervised Image-to-Image Translation

Algorithm Name	Feature	Publication
pix2pixHD	Learn a mapping that converts a semantic image to a high-resolution photorealistic image.	Wang et. al. CVPR 2018
SPADE	Improve pix2pixHD on handling diverse input labels and delivering better output quality.	Park et. al. CVPR 2019

Unsupervised Image-to-Image Translation

Algorithm Name	Feature	Publication
UNIT	Learn a one-to-one mapping between two visual domains.	Liu et. al. NeurIPS 2017
MUNIT	Learn a many-to-many mapping between two visual domains.	Huang et. al. ECCV 2018
FUNIT	Learn a style-guided image translation model that can generate translations in unseen domains.	Liu et. al. ICCV 2019
COCO-FUNIT	Improve FUNIT with a content-conditioned style encoding scheme for style code computation.	Saito et. al. ECCV 2020

Video-to-video Translation

Algorithm Name	Feature	Publication
vid2vid	Learn a mapping that converts a semantic video to a photorealistic video.	Wang et. al. NeurIPS 2018
fs-vid2vid	Learn a subject-agnostic mapping that converts a semantic video and an example image to a photoreslitic video.	Wang et. al. NeurIPS 2019

World-to-world Translation

Algorithm Name	Feature	Publication
wc-vid2vid	Improve vid2vid on view consistency and long-term consistency.	Mallya et. al. ECCV 2020
GANcraft	Convert semantic block worlds to realistic-looking worlds.	Hao et. al. ICCV 2021

Imaginaire - NVIDIA's Deep Imagination Team's PyTorch Library

Related tags

Overview

Imaginaire

Docs | License | Installation | Model Zoo

License

What's inside?

Supervised Image-to-Image Translation

Unsupervised Image-to-Image Translation

Video-to-video Translation

World-to-world Translation

Owner

NVIDIA Research Projects

A Multi-modal Model Chinese Spell Checker Released on ACL2021.

CURL: Contrastive Unsupervised Representations for Reinforcement Learning

PyTorch version implementation of DORN

Codebase for Inducing Causal Structure for Interpretable Neural Networks

RRL: Resnet as representation for Reinforcement Learning

Official PyTorch implementation of SyntaSpeech (IJCAI 2022)

GraphGT: Machine Learning Datasets for Graph Generation and Transformation

Explanatory Learning: Beyond Empiricism in Neural Networks

PyTorch implementation of NIPS 2017 paper Dynamic Routing Between Capsules

Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation

Implementation of the paper titled "Using Sampling to Estimate and Improve Performance of Automated Scoring Systems with Guarantees"

A framework for Quantification written in Python

A deep neural networks for images using CNN algorithm.

Final project code: Implementing MAE with downscaled encoders and datasets, for ESE546 FA21 at University of Pennsylvania

Mesh Graphormer is a new transformer-based method for human pose and mesh reconsruction from an input image

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

A general framework for deep learning experiments under PyTorch based on pytorch-lightning

Code for our CVPR2021 paper coordinate attention

PyTorch implementation of popular datasets and models in remote sensing

MAME is a multi-purpose emulation framework.