Imaginaire - NVIDIA's Deep Imagination Team's PyTorch Library

Last update: Dec 29, 2022

Related tags

Overview

Imaginaire

Docs | License | Installation | Model Zoo

Imaginaire is a pytorch library that contains optimized implementation of several image and video synthesis methods developed at NVIDIA.

License

Imaginaire is released under NVIDIA Software license. For commercial use, please consult NVIDIA Research Inquiries.

What's inside?

We have a tutorial for each model. Click on the model name, and your browser should take you to the tutorial page for the project.

Supervised Image-to-Image Translation

Algorithm Name	Feature	Publication
pix2pixHD	Learn a mapping that converts a semantic image to a high-resolution photorealistic image.	Wang et. al. CVPR 2018
SPADE	Improve pix2pixHD on handling diverse input labels and delivering better output quality.	Park et. al. CVPR 2019

Unsupervised Image-to-Image Translation

Algorithm Name	Feature	Publication
UNIT	Learn a one-to-one mapping between two visual domains.	Liu et. al. NeurIPS 2017
MUNIT	Learn a many-to-many mapping between two visual domains.	Huang et. al. ECCV 2018
FUNIT	Learn a style-guided image translation model that can generate translations in unseen domains.	Liu et. al. ICCV 2019
COCO-FUNIT	Improve FUNIT with a content-conditioned style encoding scheme for style code computation.	Saito et. al. ECCV 2020

Video-to-video Translation

Algorithm Name	Feature	Publication
vid2vid	Learn a mapping that converts a semantic video to a photorealistic video.	Wang et. al. NeurIPS 2018
fs-vid2vid	Learn a subject-agnostic mapping that converts a semantic video and an example image to a photoreslitic video.	Wang et. al. NeurIPS 2019

World-to-world Translation

Algorithm Name	Feature	Publication
wc-vid2vid	Improve vid2vid on view consistency and long-term consistency.	Mallya et. al. ECCV 2020
GANcraft	Convert semantic block worlds to realistic-looking worlds.	Hao et. al. ICCV 2021

Imaginaire - NVIDIA's Deep Imagination Team's PyTorch Library

Related tags

Overview

Imaginaire

Docs | License | Installation | Model Zoo

License

What's inside?

Supervised Image-to-Image Translation

Unsupervised Image-to-Image Translation

Video-to-video Translation

World-to-world Translation

Owner

NVIDIA Research Projects

Official implementation of the NRNS paper: No RL, No Simulation: Learning to Navigate without Navigating

Official repository for "PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation"

Repository containing the PhD Thesis "Formal Verification of Deep Reinforcement Learning Agents"

Image-Stitching - Panorama composition using SIFT Features and a custom implementaion of RANSAC algorithm

GraphGT: Machine Learning Datasets for Graph Generation and Transformation

Random-Afg - Afghanistan Random Old Idz Cloner Tools

Adversarial Attacks are Reversible via Natural Supervision

Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)

Toolbox of models, callbacks, and datasets for AI/ML researchers.

Continuous Time LiDAR odometry

LAVT: Language-Aware Vision Transformer for Referring Image Segmentation

we propose a novel deep network, named feature aggregation and refinement network (FARNet), for the automatic detection of anatomical landmarks.

This is a library for training and applying sparse fine-tunings with torch and transformers.

MVGCN: a novel multi-view graph convolutional network (MVGCN) framework for link prediction in biomedical bipartite networks.

BlueFog Tutorials

Structural Constraints on Information Content in Human Brain States

HDMapNet: A Local Semantic Map Learning and Evaluation Framework

Repository for the COLING 2020 paper "Explainable Automated Fact-Checking: A Survey."

CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP

Various operations like path tracking, counting, etc by using yolov5