Code for the paper: Sketch Your Own GAN

Last update: Dec 28, 2022

Related tags

Overview

Sketch Your Own GAN

Our method takes in one or a few hand-drawn sketches and customizes an off-the-shelf GAN to match the input sketch. While our new model changes an object’s shape and pose, other visual cues such as color, texture, background, are faithfully preserved after the modification.

Sheng-Yu Wang¹, David Bau², Jun-Yan Zhu¹.
CMU¹, MIT CSAIL²
In ICCV, 2021.

Training code, evaluation code, and datasets will be released soon.

Results

Our method can customize a pre-trained GAN to match input sketches.

Interpolation using our customized models. Latent space interpolation is smooth with our customized models.

Image 1

Interoplation

Image 2

Image editing using our customized models. Given a real image (a), we project it to the original model's latent space z using Huh et al. (b). (c) We then feed the projected z to the our standing cat model trained on sketches. (d) Finally, we showed edit the image with add fur operation using GANSpace.

Failure case. Our method is not capable of generating images to match the Attneave’s cat sketch or the horse sketch by Picasso. We note that Attneave’s cat depicts a complex pose, and Picasso’s sketches are drawn with a distinctive style, both of which make our method struggle.

Getting Started

Clone our repo

git clone [email protected]:PeterWang512/GANSketching.git
cd GANSketching

Install packages

Install PyTorch (version >= 1.6.0) (pytorch.org)
```
pip install -r requirements.txt
```

Download model weights

Run bash weights/download_weights.sh

Generate samples from a customized model

This command runs the customized model specified by ckpt, and generates samples to save_dir.

# generates samples from the "standing cat" model.
python generate.py --ckpt weights/photosketch_standing_cat_noaug.pth --save_dir output/samples_standing_cat

# generates samples from the cat face model in Figure. 1 of the paper.
python generate.py --ckpt weights/by_author_cat_aug.pth --save_dir output/samples_teaser_cat

Latent space edits by GANSpace

Our model preserves the latent space editability of the original model. Our models can apply the same edits using the latents reported in Härkönen et.al. (GANSpace).

# add fur to the standing cats
python ganspace.py --obj cat --comp_id 27 --scalar 50 --layers 2,4 --ckpt weights/photosketch_standing_cat_noaug.pth --save_dir output/ganspace_fur_standing_cat

# close the eyes of the standing cats
python ganspace.py --obj cat --comp_id 45 --scalar 60 --layers 5,7 --ckpt weights/photosketch_standing_cat_noaug.pth --save_dir output/ganspace_eye_standing_cat

Acknowledgments

This repository borrows partially from SPADE, stylegan2-pytorch, PhotoSketch, GANSpace, and data-efficient-gans.

Reference

If you find this useful for your research, please cite the following work.

@inproceedings{wang2021sketch,
  title={Sketch Your Own GAN},
  author={Wang, Sheng-Yu and Bau, David and Zhu, Jun-Yan},
  booktitle={Proceedings of the IEEE International Conference on Computer Vision},
  year={2021}
}

Feel free to contact us with any comments or feedback.

Code for the paper: Sketch Your Own GAN

Related tags

Overview

Sketch Your Own GAN

Results

Getting Started

Clone our repo

Install packages

Download model weights

Generate samples from a customized model

Latent space edits by GANSpace

Acknowledgments

Reference

Owner

Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax

The AugNet Python module contains functions for the fast computation of image similarity.

[CVPR 2020] Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation

Continuous Diffusion Graph Neural Network

A minimal implementation of face-detection models using flask, gunicorn, nginx, docker, and docker-compose

The code release of paper 'Domain Generalization for Medical Imaging Classification with Linear-Dependency Regularization' NIPS 2020.

Tensorflow Implementation of ECCV'18 paper: Multimodal Human Motion Synthesis

MagFace: A Universal Representation for Face Recognition and Quality Assessment

"3D Human Texture Estimation from a Single Image with Transformers", ICCV 2021

"Learning Free Gait Transition for Quadruped Robots vis Phase-Guided Controller"

StyleGAN-NADA: CLIP-Guided Domain Adaptation of Image Generators

yolov5 deepsort 行人车辆跟踪检测计数

This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in the Wild"

This repo in the implementation of EMNLP'21 paper "SPARQLing Database Queries from Intermediate Question Decompositions" by Irina Saparina, Anton Osokin

Pytorch implementation for "Implicit Semantic Response Alignment for Partial Domain Adaptation"

Source code of the paper "Deep Learning of Latent Variable Models for Industrial Process Monitoring".

Modeling Category-Selective Cortical Regions with Topographic Variational Autoencoders

Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

Video Matting via Consistency-Regularized Graph Neural Networks

《Fst Lerning of Temporl Action Proposl vi Dense Boundry Genertor》(AAAI 2020)

Code for the paper: Sketch Your Own GAN

Related tags

Overview

Sketch Your Own GAN

Results

Getting Started

Clone our repo

Install packages

Download model weights

Generate samples from a customized model

Latent space edits by GANSpace

Acknowledgments

Reference

Owner

Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax

The AugNet Python module contains functions for the fast computation of image similarity.

[CVPR 2020] Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation

Continuous Diffusion Graph Neural Network

A minimal implementation of face-detection models using flask, gunicorn, nginx, docker, and docker-compose

The code release of paper 'Domain Generalization for Medical Imaging Classification with Linear-Dependency Regularization' NIPS 2020.

Tensorflow Implementation of ECCV'18 paper: Multimodal Human Motion Synthesis

MagFace: A Universal Representation for Face Recognition and Quality Assessment

"3D Human Texture Estimation from a Single Image with Transformers", ICCV 2021

"Learning Free Gait Transition for Quadruped Robots vis Phase-Guided Controller"

StyleGAN-NADA: CLIP-Guided Domain Adaptation of Image Generators

yolov5 deepsort 行人 车辆 跟踪 检测 计数

This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in the Wild"

This repo in the implementation of EMNLP'21 paper "SPARQLing Database Queries from Intermediate Question Decompositions" by Irina Saparina, Anton Osokin

Pytorch implementation for "Implicit Semantic Response Alignment for Partial Domain Adaptation"

Source code of the paper "Deep Learning of Latent Variable Models for Industrial Process Monitoring".

Modeling Category-Selective Cortical Regions with Topographic Variational Autoencoders

Normalization Calibration (NorCal) for Long-Tailed Object Detection and Instance Segmentation

Video Matting via Consistency-Regularized Graph Neural Networks

《Fst Lerning of Temporl Action Proposl vi Dense Boundry Genertor》(AAAI 2020)

yolov5 deepsort 行人车辆跟踪检测计数