CVPR 2021: "The Spatially-Correlative Loss for Various Image Translation Tasks"

Last update: Jan 04, 2023

Related tags

Overview

Spatially-Correlative Loss

We provide the Pytorch implementation of "The Spatially-Correlative Loss for Various Image Translation Tasks". Based on the inherent self-similarity of object, we propose a new structure-preserving loss for one-sided unsupervised I2I network. The new loss will deal only with spatial relationship of repeated signal, regardless of their original absolute value.

The Spatially-Correlative Loss for Various Image Translation Tasks
Chuanxia Zheng, Tat-Jen Cham, Jianfei Cai
NTU and Monash University
In CVPR2021

ToDo

release the single-modal I2I model
a simple example to use the proposed loss

Example Results

Unpaired Image-to-Image Translation

Single Image Translation

More results on project page

Getting Started

Installation

This code was tested with Pytorch 1.7.0, CUDA 10.2, and Python 3.7

Install Pytoch 1.7.0, torchvision, and other dependencies from http://pytorch.org
Install python libraries visdom and dominate for visualization

pip install visdom dominate

Clone this repo:

git clone https://github.com/lyndonzheng/F-LSeSim
cd F-LSeSim

Datasets

Please refer to the original CUT and CycleGAN to download datasets and learn how to create your own datasets.

Training

Train the single-modal I2I translation model:

sh ./scripts/train_sc.sh

Set --use_norm for cosine similarity map, the default similarity is dot-based attention score. --learned_attn, --augment for the learned self-similarity.
To view training results and loss plots, run python -m visdom.server and copy the URL http://localhost:port.
Training models will be saved under the checkpoints folder.
The more training options can be found in the options folder.
Train the single-image translation model:

sh ./scripts/train_sinsc.sh

As the multi-modal I2I translation model was trained on MUNIT, we would not plan to merge the code to this repository. If you wish to obtain multi-modal results, please contact us at [email protected].

Testing

Test the single-modal I2I translation model:

sh ./scripts/test_sc.sh

Test the single-image translation model:

sh ./scripts/test_sinsc.sh

Test the FID score for all training epochs:

sh ./scripts/test_fid.sh

Pretrained Models

Download the pre-trained models (will be released soon) using the following links and put them undercheckpoints/ directory.

Single-image translation model: image2monet

Citation

@inproceedings{zheng2021spatiallycorrelative,
  title={The Spatially-Correlative Loss for Various Image Translation Tasks},
  author={Zheng, Chuanxia and Cham, Tat-Jen and Cai, Jianfei},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year={2021}
}

Acknowledge

Our code is developed based on CUT and CycleGAN. We also thank pytorch-fid for FID computation, LPIPS for diversity score, and D&C for density and coverage evaluation.

CVPR 2021: "The Spatially-Correlative Loss for Various Image Translation Tasks"

Related tags

Overview

Spatially-Correlative Loss

ToDo

Example Results

Unpaired Image-to-Image Translation

Single Image Translation

More results on project page

Getting Started

Installation

Datasets

Training

Testing

Pretrained Models

Citation

Acknowledge

Owner

Chuanxia Zheng

Unofficial Implementation of MLP-Mixer, gMLP, resMLP, Vision Permutator, S2MLPv2, RaftMLP, ConvMLP, ConvMixer in Jittor and PyTorch.

Official code repository for the EMNLP 2021 paper

ViDT: An Efficient and Effective Fully Transformer-based Object Detector

A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis

Stratified Transformer for 3D Point Cloud Segmentation (CVPR 2022)

Official implementation for “Unsupervised Low-Light Image Enhancement via Histogram Equalization Prior”

Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis (CVPR2022)

Python package to generate image embeddings with CLIP without PyTorch/TensorFlow

Research into Forex price prediction from price history using Deep Sequence Modeling with Stacked LSTMs.

Deployment of PyTorch chatbot with Flask

Extract MNIST handwritten digits dataset binary file into bmp images

[ICML'21] Estimate the accuracy of the classifier in various environments through self-supervision

The openspoor package is intended to allow easy transformation between different geographical and topological systems commonly used in Dutch Railway

Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016

clDice - a Novel Topology-Preserving Loss Function for Tubular Structure Segmentation

GAN Image Generator and Characterwise Image Recognizer with python

Pytorch implemenation of Stochastic Multi-Label Image-to-image Translation (SMIT)

Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)

Background Matting: The World is Your Green Screen

Bayesian Generative Adversarial Networks in Tensorflow