MDMM - Learning multi-domain multi-modality I2I translation

Last update: Nov 04, 2022

Related tags

Overview

Multi-Domain Multi-Modality I2I translation

Pytorch implementation of multi-modality I2I translation for multi-domains. The project is an extension to the "Diverse Image-to-Image Translation via Disentangled Representations(https://arxiv.org/abs/1808.00948)", ECCV 2018. With the disentangled representation framework, we can learn diverse image-to-image translation among multiple domains. [DRIT]

Contact: Hsin-Ying Lee ([email protected]) and Hung-Yu Tseng ([email protected])

Example Results

Prerequisites

Python 3.5 or Python 3.6
Pytorch 0.4.0 and torchvision (https://pytorch.org/)
TensorboardX
Tensorflow (for tensorboard usage)
Docker file based on CUDA 9.0, CuDNN 7.1, and Ubuntu 16.04 is provided in the [DRIT] github page.

Usage

Training

python train.py --dataroot DATAROOT --name NAME --num_domains NUM_DOMAINS --display_dir DISPLAY_DIR --result_dir RESULT_DIR --isDcontent

Testing

python test.py --dataroot DATAROOT --name NAME --num_domains NUM_DOMAINS --out_dir OUT_DIR --resume MODEL_DIR --num NUM_PER_IMG

Datasets

We validate our model on two datasets:

art: Containing three domains: real images, Monet images, uki-yoe images. Data can be downloaded from CycleGAN website.
weather: Containing four domains: sunny, cloudy, snowy, and foggy. Data is randomly selected from the Image2Weather dataset website.

The different domains in a dataset should be placed in folders "trainA, trainB, ..." in the alphabetical order.

Models

The pretrained model on the art dataset

bash ./models/download_model.sh art

The pretrained model on the weather dataset

bash ./models/download_model.sh weather

Note

The feature transformation (i.e. concat 0) is not fully tested since both art and weather datasets do not require shape variations
The hyper-parameters matter and are task-dependent. They are not carefully selected yet.
Feel free to contact the author for any potential improvement of the code.

Paper

Diverse Image-to-Image Translation via Disentangled Representations
Hsin-Ying Lee*, Hung-Yu Tseng*, Jia-Bin Huang, Maneesh Kumar Singh, and Ming-Hsuan Yang
European Conference on Computer Vision (ECCV), 2018 (oral) (* equal contribution)

Please cite our paper if you find the code or dataset useful for your research.

@inproceedings{DRIT,
  author = {Lee, Hsin-Ying and Tseng, Hung-Yu and Huang, Jia-Bin and Singh, Maneesh Kumar and Yang, Ming-Hsuan},
  booktitle = {European Conference on Computer Vision},
  title = {Diverse Image-to-Image Translation via Disentangled Representations},
  year = {2018}
}

MDMM - Learning multi-domain multi-modality I2I translation

Related tags

Overview

Multi-Domain Multi-Modality I2I translation

Example Results

Prerequisites

Usage

Datasets

Models

Note

Paper

Owner

Hsin-Ying Lee

Simple PyTorch implementations of Badnets on MNIST and CIFAR10.

Benchmark datasets, data loaders, and evaluators for graph machine learning

ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing

PyTorch implementation of NeurIPS 2021 paper: "CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration"

Tools for computational pathology

An Open-Source Tool for Automatic Disease Diagnosis..

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.

Let's Git - Versionsverwaltung & Open Source Hausaufgabe

Imaging, analysis, and simulation software for radio interferometry

ZeroVL - The official implementation of ZeroVL

Framework for evaluating ANNS algorithms on billion scale datasets.

[ICSE2020] MemLock: Memory Usage Guided Fuzzing

PyTorch implementation of hand mesh reconstruction described in CMR and MobRecon.

A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer from NNAISENSE.

Bottleneck Transformers for Visual Recognition

An Empirical Investigation of Model-to-Model Distribution Shifts in Trained Convolutional Filters

Python Tensorflow 2 scripts for detecting objects of any class in an image without knowing their label.

Automatic learning-rate scheduler

Real-time ground filtering algorithm of cloud points acquired using Terrestrial Laser Scanner (TLS)

Fedlearn支持前沿算法研发的Python工具库 | Fedlearn algorithm toolkit for researchers