Official implementation for "QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation" (CVPR 2022)

Last update: Dec 16, 2022

Overview

QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation (CVPR2022)

Unpaired image-to-image (I2I) translation often requires to maximize the mutual information between the source and the translated images across different domains, which is critical for the generator to keep the source content and prevent it from unnecessary modifications. The self-supervised contrastive learning has already been successfully applied in the I2I. By constraining features from the same location to be closer than those from different ones, it implicitly ensures the result to take content from the source. However, previous work uses the features from random locations to impose the constraint, which may not be appropriate since some locations contain less information of source domain. Moreover, the feature itself does not reflect the relation with others. This paper deals with these problems by intentionally selecting significant anchor points for contrastive learning. We design a query-selected attention (QS-Attn) module, which compares feature distances in the source domain, giving an attention matrix with a probability distribution in each row. Then we select queries according to their measurement of significance, computed from the distribution. The selected ones are regarded as anchors for contrastive loss. At the same time, the reduced attention matrix is employed to route features in both domains, so that source relations maintain in the synthesis. We validate our proposed method in three different I2I datasets, showing that it increases the image quality without adding learnable parameters.

QS-Attn applies attention to select anchors for contrastive learning in single-direction I2I task

Getting Started

Prerequisites

Ubuntu 16.04
NVIDIA GPU + CUDA CuDNN
Python 3 Please use pip install -r requirements.txt to install the dependencies.

Pretrained Models

We provide Global, Local and Global+Local models for three datasets.

Model	Cityscapes	Horse2zebra	AFHQ
Global	Cityscapes_Global	Horse2zebra_Global	AFHQ_Global
Local	Cityscapes_Local	Horse2zebra_Local	AFHQ_Local
Global+Local	Cityscapes_Global+Local	Horse2zebra_Global+Local	AFHQ_Global+Local

Training

Download horse2zebra dataset :

bash ./datasets/download_qsattn_dataset.sh horse2zebra

Train the global model:

python train.py \
--dataroot=datasets/horse2zebra \
--name=horse2zebra_global \
--QS_mode=global

You can use visdom to view the training loss: Run python -m visdom.server and click the URL http://localhost:8097.

Inference

Test the global model:

python test.py \
--dataroot=datasets/horse2zebra \
--name=horse2zebra_qsattn_global \
--QS_mode=global

Citation

If you use this code for your research, please cite

@article{hu2022qs,
  title={QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation},
  author={Hu, Xueqi and Zhou, Xinyue and Huang, Qiusheng and Shi, Zhengyi and Sun, Li and Li, Qingli},
  journal={arXiv preprint arXiv:2203.08483},
  year={2022}
}

Official implementation for "QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation" (CVPR 2022)

Related tags

Overview

QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation (CVPR2022)

Getting Started

Prerequisites

Pretrained Models

Training

Inference

Citation

Owner

Xueqi Hu

Source code and Dataset creation for the paper "Neural Symbolic Regression That Scales"

The repository contains reproducible PyTorch source code of our paper Generative Modeling with Optimal Transport Maps, ICLR 2022.

Interactive Visualization to empower domain experts to align ML model behaviors with their knowledge.

[CVPR'21] Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-view Transformation

Self-Supervised Image Denoising via Iterative Data Refinement

A modern pure-Python library for reading PDF files

This app is a simple example of using Strealit to create a financial data web app.

A lightweight Python-based 3D network multi-agent simulator. Uses a cell-based congestion model. Calculates risk, loudness and battery capacities of the agents. Suitable for 3D network optimization tasks.

Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming soon!

Voxel-based Network for Shape Completion by Leveraging Edge Generation (ICCV 2021, oral)

Discovering and Achieving Goals via World Models

Repository for the NeurIPS 2021 paper: "Exploiting Domain-Specific Features to Enhance Domain Generalization".

Safe Model-Based Reinforcement Learning using Robust Control Barrier Functions

Source code of our work: "Benchmarking Deep Models for Salient Object Detection"

A novel Engagement Detection with Multi-Task Training (ED-MTT) system

Python Interview Questions

ANN model for prediction a spatio-temporal distribution of supercooled liquid in mixed-phase clouds using Doppler cloud radar spectra.

A two-stage U-Net for high-fidelity denoising of historical recordings

UniLM AI - Large-scale Self-supervised Pre-training across Tasks, Languages, and Modalities

UpChecker is a simple opensource project to host it fast on your server and check is server up, view statistic, get messages if it is down. UpChecker - just run file and use project easy