Aligning Latent and Image Spaces to Connect the Unconnectable

Last update: Jan 03, 2023

Related tags

Overview

About

This repo contains the official implementation of the Aligning Latent and Image Spaces to Connect the Unconnectable paper. It is a GAN model which can generate infinite images of diverse and complex scenes.

[Project page] [Paper]

Installation

To install, run the following command:

conda env create --file environment.yml --prefix ./env
conda activate ./env

Note: the tensorboard requirement is crucial, because otherwise upfirdn2d will not compile for some magical reason.

Training

To train the model, navigate to the project directory and run:

python infra/launch_local.py hydra.run.dir=. +experiment_name=my_experiment_name +dataset=dataset_name num_gpus=4

where dataset_name is the name of the dataset without .zip extension inside data/ directory (you can easily override the paths in configs/main.yml). So make sure that data/dataset_name.zip exists and should be a plain directory of images. See StyleGAN2-ADA repo for additional data format details. This training command will create an experiment inside experiments/ directory and will copy the project files into it. This is needed to isolate the code which produces the model.

Inference

The inference example can be found in notebooks/generate.ipynb

Data format

We use the same data format as the original StyleGAN2-ADA repo: it is a zip of images. It is assumed that all data is located in a single directory, specified in configs/main.yml. Put your datasets as zip archives into data/ directory.

Pretrained checkpoints

We provide checkpoints for the following datasets:

LHQ 1024x1024 with FID = 7.8. Note: this checkpoint has patch size of 1024x512, i.e. the image is generated in just 2 halves.

License

The project is based on the StyleGAN2-ADA repo developed by NVidia. I am not a lawyer, but I suppose that NVidia License applies to this project then.

Aligning Latent and Image Spaces to Connect the Unconnectable

Related tags

Overview

About

Installation

Training

Inference

Data format

Pretrained checkpoints

License

Owner

Ivan Skorokhodov

Code for the paper "Adversarially Regularized Autoencoders (ICML 2018)" by Zhao, Kim, Zhang, Rush and LeCun

PyTorch implementation of a collections of scalable Video Transformer Benchmarks.

A privacy-focused, intelligent security camera system.

Automatic Idiomatic Expression Detection

DGCNN - Dynamic Graph CNN for Learning on Point Clouds

BackgroundRemover lets you Remove Background from images and video with a simple command line interface

Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"

This repository is the offical Pytorch implementation of ContextPose: Context Modeling in 3D Human Pose Estimation: A Unified Perspective (CVPR 2021).

Use of Attention Gates in a Convolutional Neural Network / Medical Image Classification and Segmentation

PiRank: Learning to Rank via Differentiable Sorting

VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning

Learning Saliency Propagation for Semi-supervised Instance Segmentation

PyTorch implementations of the paper: "Learning Independent Instance Maps for Crowd Localization"

Point detection through multi-instance deep heatmap regression for sutures in endoscopy

Deep generative modeling for time-stamped heterogeneous data, enabling high-fidelity models for a large variety of spatio-temporal domains.

商品推荐系统

[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

pix2pix in tensorflow.js

A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks