Pytorch implementation of the paper Improving Text-to-Image Synthesis Using Contrastive Learning

Last update: Dec 31, 2022

Related tags

Deep Learning T2I_CL

Overview

T2I_CL

This is the official Pytorch implementation of the paper Improving Text-to-Image Synthesis Using Contrastive Learning

Requirements

Linux
Python ≥ 3.6
PyTorch ≥ 1.4.0

Prepare Data

Download the preprocessed datasets from AttnGAN

Alternatively, another site is from DM-GAN

Training

Pretrain DAMSM+CL:
- For bird dataset: python pretrain_DAMSM.py --cfg cfg/DAMSM/bird.yml --gpu 0
- For coco dataset: python pretrain_DAMSM.py --cfg cfg/DAMSM/coco.yml --gpu 0
Train AttnGAN+CL:
- For bird dataset: python main.py --cfg cfg/bird_attn2.yml --gpu 0
- For coco dataset: python main.py --cfg cfg/coco_attn2.yml --gpu 0
Train DM-GAN+CL:
- For bird dataset: python main.py --cfg cfg/bird_DMGAN.yml --gpu 0
- For coco dataset: python main.py --cfg cfg/coco_DMGAN.yml --gpu 0

Pretrained Models

DAMSM+CL for bird. Download and save it to DAMSMencoders/
DAMSM+CL for coco. Download and save it to DAMSMencoders/
AttnGAN+CL for bird. Download and save it to models/
AttnGAN+CL for coco. Download and save it to models/
DM-GAN+CL for bird. Download and save it to models/
DM-GAN+CL for coco. Download and save it to models/

Evaluation

Sampling and get the R-precision:
- python main.py --cfg cfg/eval_bird.yml --gpu 0
- python main.py --cfg cfg/eval_coco.yml --gpu 0
Inception score:
- python inception_score_bird.py --image_folder fake_images_bird
- python inception_score_coco.py fake_images_coco
FID:
- python fid_score.py --gpu 0 --batch-size 50 --path1 real_images_bird --path2 fake_images_bird
- python fid_score.py --gpu 0 --batch-size 50 --path1 real_images_coco --path2 fake_images_coco

Citation

If you find this work useful in your research, please consider citing:

@article{ye2021improving,
  title={Improving Text-to-Image Synthesis Using Contrastive Learning},
  author={Ye, Hui and Yang, Xiulong and Takac, Martin and Sunderraman, Rajshekhar and Ji, Shihao},
  journal={arXiv preprint arXiv:2107.02423},
  year={2021}
}

Acknowledge

Our work is based on the following works:

Pytorch implementation of the paper Improving Text-to-Image Synthesis Using Contrastive Learning

Related tags

Overview

T2I_CL

Requirements

Prepare Data

Training

Pretrained Models

Evaluation

Citation

Acknowledge

Owner

Code for visualizing the loss landscape of neural nets

Multi-Anchor Active Domain Adaptation for Semantic Segmentation (ICCV 2021 Oral)

VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition

Official implementation of Unfolded Deep Kernel Estimation for Blind Image Super-resolution.

MAterial del programa Misión TIC 2022

The official implementation of the CVPR 2021 paper FAPIS: a Few-shot Anchor-free Part-based Instance Segmenter

Advanced yabai wooting scripts

Source Code of NeurIPS21 paper: Recognizing Vector Graphics without Rasterization

Some methods for comparing network representations in deep learning and neuroscience.

An e-commerce company wants to segment its customers and determine marketing strategies according to these segments.

Tackling Obstacle Tower Challenge using PPO & A2C combined with ICM.

Locally Differentially Private Distributed Deep Learning via Knowledge Distillation (LDP-DL)

Code for technical report "An Improved Baseline for Sentence-level Relation Extraction".

PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning

Keepsake is a Python library that uploads files and metadata (like hyperparameters) to Amazon S3 or Google Cloud Storage

[NeurIPS-2021] Slow Learning and Fast Inference: Efficient Graph Similarity Computation via Knowledge Distillation

Re-implement CycleGAN in Tensorlayer

Scenic: A Jax Library for Computer Vision and Beyond

Official code for MPG2: Multi-attribute Pizza Generator: Cross-domain Attribute Control with Conditional StyleGAN

A multi-functional library for full-stack Deep Learning. Simplifies Model Building, API development, and Model Deployment.