Learning What and Where to Draw

Last update: Nov 18, 2022

Related tags

Deep Learning nips2016

Overview

###Learning What and Where to Draw Scott Reed, Zeynep Akata, Santosh Mohan, Samuel Tenka, Bernt Schiele, Honglak Lee

This is the code for our NIPS 2016 paper on text- and location-controllable image synthesis using conditional GANs. Much of the code is adapted from reedscot/icml2016 and dcgan.torch.

####Setup Instructions

You will need to install Torch, CuDNN, stnbhwd and the display package.

####How to train a text to image model:

Download the data including captions, location annotations and pretrained models.
Download the birds and humans image data.
Modify the CONFIG file to point to your data.
Run one of the training scripts, e.g. ./scripts/train_cub_keypoints.sh

####How to generate samples:

./scripts/run_all_demos.sh.
html files will be generated with results like the following:

Moving the bird's position via bounding box:

Moving the bird's position via keypoints:

Birds text to image with ground-truth keypoints:

Birds text to image with generated keypoints:

Humans text to image with ground-truth keypoints:

Humans text to image with generated keypoints:

####Citation

If you find this useful, please cite our work as follows:

@inproceedings{reed2016learning,
  title={Learning What and Where to Draw},
  author={Scott Reed and Zeynep Akata and Santosh Mohan and Samuel Tenka and Bernt Schiele and Honglak Lee},
  booktitle={Advances in Neural Information Processing Systems},
  year={2016}
}

Learning What and Where to Draw

Related tags

Overview

Owner

Scott Ellison Reed

A web application that provides real time temperature and humidity readings of a house.

Airborne Optical Sectioning (AOS) is a wide synthetic-aperture imaging technique

Gans-in-action - Companion repository to GANs in Action: Deep learning with Generative Adversarial Networks

Train CNNs for the fruits360 data set in NTOU CS「Machine Vision」class.

Python3 / PyTorch implementation of the following paper: Fine-grained Semantics-aware Representation Enhancement for Self-supervisedMonocular Depth Estimation. ICCV 2021 (oral)

Breaching - Breaching privacy in federated learning scenarios for vision and text

PyTorch implementation for "Sharpness-aware Quantization for Deep Neural Networks".

Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021

Fastshap: A fast, approximate shap kernel

The project page of paper: Architecture disentanglement for deep neural networks [ICCV 2021, oral]

Random Walk Graph Neural Networks

Python package for multiple object tracking research with focus on laboratory animals tracking.

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .

StyleGAN2 Webtoon / Anime Style Toonify

Instance Semantic Segmentation List

An experiment to bait a generalized frontrunning MEV bot

A torch implementation of "Pixel-Level Domain Transfer"

Research code of ICCV 2021 paper "Mesh Graphormer"

OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion.

CN24 is a complete semantic segmentation framework using fully convolutional networks

Learning What and Where to Draw

Related tags

Overview

Owner

Scott Ellison Reed

A web application that provides real time temperature and humidity readings of a house.

Airborne Optical Sectioning (AOS) is a wide synthetic-aperture imaging technique

Gans-in-action - Companion repository to GANs in Action: Deep learning with Generative Adversarial Networks

Train CNNs for the fruits360 data set in NTOU CS「Machine Vision」class.

Python3 / PyTorch implementation of the following paper: Fine-grained Semantics-aware Representation Enhancement for Self-supervisedMonocular Depth Estimation. ICCV 2021 (oral)

Breaching - Breaching privacy in federated learning scenarios for vision and text

PyTorch implementation for "Sharpness-aware Quantization for Deep Neural Networks".

Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021

Fastshap: A fast, approximate shap kernel

The project page of paper: Architecture disentanglement for deep neural networks [ICCV 2021, oral]

Random Walk Graph Neural Networks

Python package for multiple object tracking research with focus on laboratory animals tracking.

The official PyTorch implementation of the paper: *Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." *.

StyleGAN2 Webtoon / Anime Style Toonify

Instance Semantic Segmentation List

An experiment to bait a generalized frontrunning MEV bot

A torch implementation of "Pixel-Level Domain Transfer"

Research code of ICCV 2021 paper "Mesh Graphormer"

OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion.

CN24 is a complete semantic segmentation framework using fully convolutional networks

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .