TextureGAN in Pytorch

Last update: Dec 14, 2022

Related tags

Overview

TextureGAN

This code is our PyTorch implementation of TextureGAN [Project] [Arxiv]

TextureGAN is a generative adversarial network conditioned on sketch and colors/textures. Users “drag” one or more example textures onto sketched objects and the network realistically applies these textures to the indicated objects.

Setup

Prerequisites

Linux or OSX
Python 2.7
NVIDIA GPU + CUDA CuDNN

Dependency

Visdom
Ipython notebook
Pytorch 0.2 (torch and torchvision)
Numpy scikit-image matplotlib etc.

Getting Started

Clone this repo

git clone [email protected]:janesjanes/texturegan.git
cd texturegan

Prepare Datasets Download the training data:

wget https://s3-us-west-2.amazonaws.com/texturegan/training_handbag.tar.gz
tar -xvcf training_handbag.tar.gz

For shoe: https://s3-us-west-2.amazonaws.com/texturegan/training_shoe.tar.gz

For cloth: https://s3-us-west-2.amazonaws.com/texturegan/training_cloth.tar.gz

Train the model from scratch. See python main.py --help for training options. Example arguments (see the paper for the exact parameters value):

python main.py --display_port 7779 --gpu 3 --model texturegan --feature_weight 5e3 --pixel_weight_ab 1e4 
--global_pixel_weight_l 5e5 --local_pixel_weight_l 0 --style_weight 0 --discriminator_weight 5e5 --discriminator_local_weight 7e5  --learning_rate 5e-4 --learning_rate_D 1e-4 --batch_size 36 --save_every 100 --num_epoch 100000 --save_dir [./save_dir] 
--data_path [training_handbags_pretrain/] --learning_rate_D_local  1e-4 --local_texture_size 50 --patch_size_min 20 
--patch_size_max 50 --num_input_texture_patch 1 --visualize_every 5 --num_local_texture_patch 5

Models will be saved to ./save_dir

See more training details in section Train

You can also load our pretrained models in section Download Models.

To view results and losses as the model trains, start a visdom server for the ‘display_port’

python -m visdom.server -port 7779

Test the model

See our Ipython Notebook Test_script.ipynb

Train

TextureGAN proposes a two-stage training scheme.

The first training state is ground-truth pre-training. We extract input edge and texture patch from the same ground-truth image. Here, we show how to train the ground-truth pretrained model using a combination of pixel loss, color loss, feature loss, and adverserial loss.

python main.py --display_port 7779 --gpu 0 --model texturegan --feature_weight 10 --pixel_weight_ab 1e5 
--global_pixel_weight_l 100 --style_weight 0 --discriminator_weight 10 --learning_rate 1e-3 --learning_rate_D 1e-4 --save_dir
[/home/psangkloy3/handbag_texturedis_scratch] --data_path [./save_dir] --batch_size 16 --save_every 500 --num_epoch 100000 
--input_texture_patch original_image --loss_texture original_image --local_texture_size 50 --discriminator_local_weight 100  
--num_input_texture_patch 1

The second stage is external texture fine-tuning. This step is important for the network to reproduce textures for which we have no ground-truth output (e.g. a handbag with snakeskin texture). This time, we extract texture patch from an external texture dataset (see more in Section Download Dataset). We keep the feature and adversarial losses unchanged, but modify the pixel and color losses, to compare the generated result with the entire input texture from which input texture patches are extracted. We fine tune on previous pretrained model with addition of local texture loss by training a separate texture discriminator.

python main.py --display_port 7779 --load 1500 --load_D 1500 --load_epoch 222 --gpu 0 --model texturegan --feature_weight 5e3
--pixel_weight_ab 1e4 --global_pixel_weight_l 5e5 --local_pixel_weight_l 0 --style_weight 0 --discriminator_weight 5e5 
--discriminator_local_weight 7e5  --learning_rate 5e-4 --learning_rate_D 1e-4 --batch_size 36 --save_every 100 --num_epoch
100000 --save_dir [skip_leather_handbag/] --load_dir [handbag_texturedis_scratch/] 
--data_path [./save_dir] --learning_rate_D_local  1e-4 --local_texture_size 50 --patch_size_min 20 --patch_size_max 50 
--num_input_texture_patch 1 --visualize_every 5 --input_texture_patch dtd_texture --num_local_texture_patch 5

Download Datasets

The datasets we used for generating sketch and image pair in this paper are collected by other researchers. Please cite their papers if you use the data. The dataset is split into train and test set.

Shoes dataset: ## training images from UT Zappos50K dataset.
Hangbags dataset: ## Amazon Handbag images from iGAN project.
Deep Fashion Dataset: ## clothes images from Deep Fashion Dataset.

Edges are computed by HED edge detector + post-processing. [Citation]

The datasets we used for inputting texture patches are DTD Dataset and leather dataset we collected from the internet.

DTD Dataset:
Leather Dataset:

Download Models

Pre-trained models

For shoe model
For handbag model
<a href=https://s3-us-west-2.amazonaws.com/texturegan/final_cloth_finetune.pth'> For clothe model

Citation

If you find it this code useful for your research, please cite:

"TextureGAN: Controlling Deep Image Synthesis with Texture Patches"

Wenqi Xian, Patsorn Sangkloy, Varun Agrawal, Amit Raj, Jingwan Lu, Chen Fang, Fisher Yu, James Hays in CVPR, 2018.

@article{xian2017texturegan,
  title={Texturegan: Controlling deep image synthesis with texture patches},
  author={Xian, Wenqi and Sangkloy, Patsorn and Agrawal, Varun and Raj, Amit and Lu, Jingwan and Fang, Chen and Yu, Fisher and Hays, James},
  journal={arXiv preprint arXiv:1706.02823},
  year={2017}
}

TextureGAN in Pytorch

Related tags

Overview

TextureGAN

Setup

Prerequisites

Dependency

Getting Started

Train

Download Datasets

Download Models

Citation

Owner

Patsorn

AutoDeeplab / auto-deeplab / AutoML for semantic segmentation, implemented in Pytorch

RodoSol-ALPR Dataset

Analyses of the individual electric field magnitudes with Roast.

Technical experimentations to beat the stock market using deep learning :chart_with_upwards_trend:

clustimage is a python package for unsupervised clustering of images.

Normal Learning in Videos with Attention Prototype Network

Hyperbolic Image Segmentation, CVPR 2022

This is the official code for the paper "Tracker Meets Night: A Transformer Enhancer for UAV Tracking".

Code for paper 'Hand-Object Contact Consistency Reasoning for Human Grasps Generation' at ICCV 2021

Code for the paper "Improved Techniques for Training GANs"

Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxiang Wang, Han Zhao, Bo Li.

git《Joint Entity and Relation Extraction with Set Prediction Networks》(2020) GitHub:

SSL_SLAM2: Lightweight 3-D Localization and Mapping for Solid-State LiDAR (mapping and localization separated) ICRA 2021

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

EMNLP 2021 paper Models and Datasets for Cross-Lingual Summarisation.

Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations

STARCH compuets regional extreme storm physical characteristics and moisture balance based on spatiotemporal precipitation data from reanalysis or climate model data.

Unofficial implementation of Proxy Anchor Loss for Deep Metric Learning

Convolutional neural network that analyzes self-generated images in a variety of languages to find etymological similarities

LightningFSL: Pytorch-Lightning implementations of Few-Shot Learning models.