Pytorch Implementation of the paper "Cross-domain Correspondence Learning for Exemplar-based Image Translation"

Last update: Sep 22, 2021

Related tags

Overview

CoCosNet

Pytorch Implementation of the paper "Cross-domain Correspondence Learning for Exemplar-based Image Translation" (CVPR 2020 oral).

Update:

20200525: Training code for deepfashion complete. Due to the memory limitations, I employed the following conversions:

Disable the non-local layer, as the memory cost is infeasible on common hardware. If the original paper is telling the truth that the non-lacal layer works on (128-128-256) tensors, then each attention matrix would contain 128^4 elements (which takes 1GB).
Shrink the correspondence map size from 64 to 32, leading to 4x memory save on dense correspondence matrices.
Shrink the base number of filters from 64 to 16.

The truncated model barely fits in a 12GB GTX Titan X card, but the performance would not be the same.

Environment

Ubuntu/CentOS
Pytorch 1.0+
opencv-python
tqdm

TODO list

Dataset Preparation

DeepFashion

Just follow the routine in the PATN repo

Pretrained Model

The pretrained model for human pose transfer task: TO BE RELEASED

Training

run python train.py.

Citations

If you find this repo useful for your research, don't forget to cite the original paper:

@article{Zhang2020CrossdomainCL,
  title={Cross-domain Correspondence Learning for Exemplar-based Image Translation},
  author={Pan Zhang and Bo Zhang and Dong Chen and Lu Yuan and Fang Wen},
  journal={ArXiv},
  year={2020},
  volume={abs/2004.05571}
}

Acknowledgement

TODO.

Pytorch Implementation of the paper "Cross-domain Correspondence Learning for Exemplar-based Image Translation"

Related tags

Overview

CoCosNet

Update:

Environment

TODO list

Dataset Preparation

DeepFashion

Pretrained Model

Training

Citations

Acknowledgement

Owner

Lingbo Yang

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

MASA-SR: Matching Acceleration and Spatial Adaptation for Reference-Based Image Super-Resolution (CVPR2021)

ScaleNet: A Shallow Architecture for Scale Estimation

Source code for TACL paper "KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation".

Earth Vision Foundation

Code to reproduce the results for Compositional Attention

This is a package for LiDARTag, described in paper: LiDARTag: A Real-Time Fiducial Tag System for Point Clouds

Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation

Code for ICCV 2021 paper Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes using Scene Graphs

Multi-Horizon-Forecasting-for-Limit-Order-Books

TraND: Transferable Neighborhood Discovery for Unsupervised Cross-domain Gait Recognition.

Fine-Tune EleutherAI GPT-Neo to Generate Netflix Movie Descriptions in Only 47 Lines of Code Using Hugginface And DeepSpeed

KE-Dialogue: Injecting knowledge graph into a fully end-to-end dialogue system.

Neural-Pull: Learning Signed Distance Functions from Point Clouds by Learning to Pull Space onto Surfaces(ICML 2021)

The PyTorch implementation for paper "Neural Texture Extraction and Distribution for Controllable Person Image Synthesis" (CVPR2022 Oral)

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Multi-task Multi-agent Soft Actor Critic for SMAC

ViViT: Curvature access through the generalized Gauss-Newton's low-rank structure

Materials for my scikit-learn tutorial

Lowest memory consumption and second shortest runtime in NTIRE 2022 challenge on Efficient Super-Resolution