SAVI2I: Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors

Last update: Dec 30, 2022

Related tags

Overview

SAVI2I: Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors

[Paper] [Project Website]

Pytorch implementation for SAVI2I. We propose a simple yet effective signed attribute vector (SAV) that facilitates continuous translation on diverse mapping paths across multiple domains.
More video results please see Our Webpage
Contact: Qi Mao ([email protected])

Paper

Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors
Qi Mao, Hsin-Ying Lee, Hung-Yu Tseng, Jia-Bin Huang, Siwei Ma, and Ming-Hsuan Yang
In arXiv 2020

Citation

If you find this work useful for your research, please cite our paper:

    @article{mao2020continuous,
      author       = "Mao, Qi and Lee, Hsin-Ying and Tseng, Hung-Yu and Huang, Jia-Bin and Ma, Siwei and Yang, Ming-Hsuan",
      title        = "Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors",
      journal    = "arXiv preprint 2011.01215",
      year         = "2020"
    }

Quick Start

Prerequisites

Linux or Windows
Python 3+
Suggest to use two P100 16GB GPUs or One V100 32GB GPU.

Install

Clone this repo:

git clone https://github.com/HelenMao/SAVI2I.git
cd SAVI2I

This code requires Pytorch 0.4.0+ and Python 3+. Please install dependencies by

conda create -n SAVI2I python=3.6
source activate SAVI2I
pip install -r requirements.txt

Training Datasets

Download datasets for each task into the dataset folder

./datasets

Style translation: Yosemite (summer <-> winter) and Photo2Artwork (Photo, Monet, Van Gogh and Ukiyo-e)

You can follow the instructions of CycleGAN datasets to download Yosemite and Photo2artwork datasets.

Shape-variation translation: CelebA-HQ (Male <-> Female) and AFHQ (Cat, Dog and WildLife)

We split CelebA-HQ into male and female domains according to the annotated label and fine-tune the images manaully.

You can follow the instructions of StarGAN-v2 datasets to download CelebA-HQ and AFHQ datasets.

Training

Notes

For low-level style translation tasks, you suggest to set --type=1 to use corresponding network architectures.
For shape-variation translation tasks, you suggest to set --type=0 to use corresponding network architectures.

Yosemite

python train.py --dataroot ./datasets/Yosemite/ --phase train --type 1 --name Yosemite --n_ep 700 --n_ep_decay 500 --lambda_r1 10 --lambda_mmd 1 --num_domains 2

Photo2artwork

python train.py --dataroot ./datasets/Photo2artwork/ --phase train --type 1 --name Photo2artwork --n_ep 100 --n_ep_decay 0 --lambda_r1 10 --lambda_mmd 1 --num_domains 4

CelebAHQ

python train.py --dataroot ./datasets/CelebAHQ/ --phase train --type 0 --name CelebAHQ --n_ep 30 --n_ep_decay 0 --lambda_r1 1 --lambda_mmd 1 --num_domains 2

AFHQ

python train.py --dataroot ./datasets/AFHQ/ --phase train --type 0 --name AFHQ --n_ep 100 --n_ep_decay 0 --lambda_r1 1 --lambda_mmd 10 --num_domains 3

Pre-trained Models

Download and save them into

./models

or download the pre-trained models with the following script.

bash ./download_models.sh

Testing

Reference-guided

python test_reference_save.py --dataroot ./datasets/CelebAHQ --resume ./models/CelebAHQ/00029.pth --phase test --type 0 --num_domains 2 --index_s A --index_t B --num 5 --name CelebAHQ_ref

Latent-guided

python test_latent_rdm_save.py --dataroot ./datasets/CelebAHQ --resume ./models/CelebAHQ/00029.pth --phase test --type 0 --num_domains 2 --index_s A --index_t B --num 5 --name CelebAHQ_rdm

License

All rights reserved.
Licensed under the CC BY-NC-SA 4.0 (Attribution-NonCommercial-ShareAlike 4.0 International).
The codes are only for academical research use. For commercial use, please contact [email protected].

Acknowledgements

Codes and network architectures inspired from:

SAVI2I: Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors

Related tags

Overview

SAVI2I: Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors

[Paper] [Project Website]

Paper

Citation

Quick Start

Prerequisites

Install

Training Datasets

Training

Notes

Pre-trained Models

Testing

License

Acknowledgements

Owner

Qi Mao

Auto_code_complete is a auto word-completetion program which allows you to customize it on your needs

Code for the paper TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x using fastT5.

Sentiment Analysis Project using Count Vectorizer and TF-IDF Vectorizer

Implementing SimCSE(paper, official repository) using TensorFlow 2 and KR-BERT.

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Conversational-AI-ChatBot - Intelligent ChatBot built with Microsoft's DialoGPT transformer to make conversations with human users!

Code for CodeT5: a new code-aware pre-trained encoder-decoder model.

Translation for Trilium Notes. Trilium Notes 中文版.

Data preprocessing rosetta parser for python

A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any other format

A python package for deep multilingual punctuation prediction.

Chinese Pre-Trained Language Models (CPM-LM) Version-I

Задания КЕГЭ по информатике 2021 на Python

Modeling cumulative cases of Covid-19 in the US during the Covid 19 Delta wave using Bayesian methods.

Python wrapper for Stanford CoreNLP tools v3.4.1

RuCLIP-SB (Russian Contrastive Language–Image Pretraining SWIN-BERT) is a multimodal model for obtaining images and text similarities and rearranging captions and pictures. Unlike other versions of the model we use BERT for text encoder and SWIN transformer for image encoder.

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

I can help you convert your images to pdf file.