The implementation of 'Image synthesis via semantic composition'.

Last update: Jan 06, 2023

Related tags

Overview

Image synthesis via semantic synthesis [Project Page]

by Yi Wang, Lu Qi, Ying-Cong Chen, Xiangyu Zhang, Jiaya Jia.

Introduction

This repository gives the implementation of our semantic image synthesis method in ICCV 2021 paper, 'Image synthesis via semantic synthesis'.

Our framework

Usage

git clone https://github.com/dvlab-research/SCGAN.git
cd SCGAN/code

To use this code, please install PyTorch 1.0 and Python 3+. Other dependencies can be installed by

pip install -r requirements.txt

Dataset Preparation

Please refer to SPADE for detailed execution.

Testing

Downloading pretrained models, then putting the folder containing model weights in the folder ./checkpoints.
Producing images with the pretrained models.

python test.py --gpu_ids 0,1,2,3 --dataset_mode [dataset] --config config/scgan_[dataset]_test.yml --fid --gt [gt_path] --visual_n 1

For example,

python test.py --gpu_ids 0,1,2,3 --dataset_mode celeba --config config/scgan_celeba-test.yml --fid --gt /data/datasets/celeba --visual_n 1

Visual results are stored at ./results/scgan_[dataset]/ by default.

Pretrained Models (to be updated)

Dataset	Download link
CelebAMask-HQ	Baidu Disk (Code: face)

Training

Using train.sh to train new models. Or you can specify training options in config/[config_file].yml.

Key operators

Our proposed dynamic computation units (spatial conditional convolution and normalization) are extended from conditionally parameterized convolutions [1]. We generalize the scalar condition into a spatial one and also apply these techniques to normalization.

Citation

If our research is useful for you, please consider citing:

@inproceedings{wang2021image,
  title={Image Synthesis via Semantic Composition},
  author={Wang, Yi and Qi, Lu and Chen, Ying-Cong and Zhang, Xiangyu and Jia, Jiaya},
  booktitle={ICCV},
  year={2021}
}

Acknowledgements

This code is built upon SPADE, Imaginaire, and PyTorch-FID.

Reference

[1] Brandon Yang, Gabriel Bender, Quoc V Le, and Jiquan Ngiam. Condconv: Conditionally parameterized convolutions for efficient inference. In NeurIPS. 2019.

Contact

Please send email to [email protected].

The implementation of 'Image synthesis via semantic composition'.

Related tags

Overview

Image synthesis via semantic synthesis [Project Page]

Introduction

Our framework

Usage

Dataset Preparation

Testing

Pretrained Models (to be updated)

Training

Key operators

Citation

Acknowledgements

Reference

Contact

Owner

DV Lab

This repository attempts to replicate the SqueezeNet architecture and implement the same on an image classification task.

Exploring the link between uncertainty estimates obtained via "exact" Bayesian inference and out-of-distribution (OOD) detection.

LoFTR:Detector-Free Local Feature Matching with Transformers CVPR 2021

Causal Imitative Model for Autonomous Driving

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

Official PyTorch implementation of PS-KD

A machine learning malware analysis framework for Android apps.

Build tensorflow keras model pipelines in a single line of code. Created by Ram Seshadri. Collaborators welcome. Permission granted upon request.

MLJetReconstruction - using machine learning to reconstruct jets for CMS

Spatial color quantization in Rust

Mmdetection3d Noted - MMDetection3D is an open source object detection toolbox based on PyTorch

Codebase for the Summary Loop paper at ACL2020

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)

Fast (simple) spectral synthesis and emission-line fitting of DESI spectra.

On-device speech-to-intent engine powered by deep learning

Goal of the project : Detecting Temporal Boundaries in Sign Language videos

This repository contains several image-to-image translation models, whcih were tested for RGB to NIR image generation. The models are Pix2Pix, Pix2PixHD, CycleGAN and PointWise.

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

[NeurIPS 2021] Better Safe Than Sorry: Preventing Delusive Adversaries with Adversarial Training

Companion code for the paper "An Infinite-Feature Extension for Bayesian ReLU Nets That Fixes Their Asymptotic Overconfidence" (NeurIPS 2021)