Official implementation of the paper Image Generators with Conditionally-Independent Pixel Synthesis https://arxiv.org/abs/2011.13775

Last update: Dec 21, 2022

Overview

CIPS -- Official Pytorch Implementation

of the paper Image Generators with Conditionally-Independent Pixel Synthesis

Requirements

pip install -r requirements.txt

Usage

First create lmdb datasets:

python prepare_data.py images --out LMDB_PATH --n_worker N_WORKER --size SIZE1,SIZE2,SIZE3,... DATASET_PATH

This will convert images to jpeg and pre-resizes it.

To train on FFHQ-256 or churches please run:

python3 -m torch.distributed.launch --nproc_per_node=8 --master_port=1234 train.py --n_sample=8 --batch=4 --fid_batch=8 --Generator=CIPSskip --output_dir=skip-[ffhq/churches] --img2dis --num_workers=16 DATASET_PATH

To train on patches add --crop=PATCH_SIZE. PATCH_SIZE has to be a power of 2.

Pretrained Checkpoints

Generate samples

To play with the models please download checkpoints and check out a notebook.ipynb

Progressive training

We also tried to train progressively on FFHQ starting from 256×256 initialization and got FID 10.07. We will update the paper with the training details soon. Checkpoint name is ffhq1024.pt. Samples are below.

Citation

If you found our work useful, please don't forget to cite

@article{anokhin2020image,
  title={Image Generators with Conditionally-Independent Pixel Synthesis},
  author={Anokhin, Ivan and Demochkin, Kirill and Khakhulin, Taras and Sterkin, Gleb and Lempitsky, Victor and Korzhenkov, Denis},
  journal={arXiv preprint arXiv:2011.13775},
  year={2020}
}

The code is heavely based on the styleganv2 pytorch implementation

Nvidia-licensed CUDA kernels (fused_bias_act_kernel.cu, upfirdn2d_kernel.cu) is for non-commercial use only.

Official implementation of the paper Image Generators with Conditionally-Independent Pixel Synthesis https://arxiv.org/abs/2011.13775

Related tags

Overview

CIPS -- Official Pytorch Implementation

Requirements

Usage

Pretrained Checkpoints

Generate samples

Progressive training

Citation

Owner

Multimodal Lab @ Samsung AI Center Moscow

SymPy-powered, Wolfram|Alpha-like answer engine totally in your browser, without backend computation

Official Code Release for "CLIP-Adapter: Better Vision-Language Models with Feature Adapters"

Multi-Task Temporal Shift Attention Networks for On-Device Contactless Vitals Measurement (NeurIPS 2020)

From Fidelity to Perceptual Quality: A Semi-Supervised Approach for Low-Light Image Enhancement (CVPR'2020)

A PyTorch based deep learning library for drug pair scoring.

GAN encoders in PyTorch that could match PGGAN, StyleGAN v1/v2, and BigGAN. Code also integrates the implementation of these GANs.

Few-shot Neural Architecture Search

Time Series Forecasting with Temporal Fusion Transformer in Pytorch

The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

Source code for the paper: Variance-Aware Machine Translation Test Sets (NeurIPS 2021 Datasets and Benchmarks Track)

TensorRT examples (Jetson, Python/C++)(object detection)

Python-kafka-reset-consumergroup-offset-example - Python Kafka reset consumergroup offset example

This repository contains the code for the binaural-detection model used in the publication arXiv:2111.04637

Generate images from texts. In Russian

Bounding Wasserstein distance with couplings

PyTorch implementation of "Contrast to Divide: self-supervised pre-training for learning with noisy labels"

An official implementation of MobileStyleGAN in PyTorch

In this project we combine techniques from neural voice cloning and musical instrument synthesis to achieve good results from as little as 16 seconds of target data.

Flickr-Faces-HQ (FFHQ) is a high-quality image dataset of human faces, originally created as a benchmark for generative adversarial networks (GAN)

AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation