Official implementation of Self-supervised Image-to-text and Text-to-image Synthesis

Last update: Jul 31, 2022

Related tags

Overview

Self-supervised Image-to-text and Text-to-image Synthesis

This is the official implementation of Self-supervised Image-to-text and Text-to-image Synthesis. The architecture of and are shown.

Dataset

We use Caltech-UCSD Birds-200-2011 and Oxford-102 datasets in this work.

Download Flower images
Rename the jpg folder to images and unzip 102flowers.zip and put it inside 102flowers folder
put 102flowers folder inside data folder
Download Birds data and put inside Data/
Download image data Extract them to Data/birds/

Dependencies

pytorch
torchvision
tensorboardX
pickle

Training

Training the image autoencoder

The driver program for training the image autoencoder is main.py

To train the image autoencoder on flower dataset

python main.py --cfg cfg/flowers_3stages.yml --gpu 0

To train the image autoencoder birds dataset

python main.py --cfg cfg/birds_3stages.yml --gpu 0

Models will automatically saved after a fixed number of iteration, to restart from a failed step edit netG_version in respective .yml file

Training the text autoencoder

python run_text_test.py dataset_type Input_Folder output_file.txt

For Flower Dataset dataset_type=1, for Birds Dataset dataset_type=2 e.g.

python run_text_test.py 2 /home/user/dev/unsup/data_datasets/CUB_200_2011 outbirds_n.txt

Training the mapping networks

Train the GAN-based mapping network

python MappingImageText.py Dataset_folder

e.g.

python MappingImageText.py /home/user/dev/unsup/data_datasets/CUB_200_2011

Train the MMD-based mapping network

python mmd_ganTI.py --dataset /home/das/dev/data_datasets/birds_dataset/CUB_200_2011 --gpu_device 0

python mmd_ganIT.py --dataset /home/das/dev/data_datasets/birds_dataset/CUB_200_2011 --gpu_device 0

Official implementation of Self-supervised Image-to-text and Text-to-image Synthesis

Related tags

Overview

Self-supervised Image-to-text and Text-to-image Synthesis

Dataset

Dependencies

Training

Training the image autoencoder

To train the image autoencoder on flower dataset

To train the image autoencoder birds dataset

Training the text autoencoder

Training the mapping networks

Train the GAN-based mapping network

Train the MMD-based mapping network

Owner

天勤量化开发包, 期货量化, 实时行情/历史数据/实盘交易

Code for the paper "JANUS: Parallel Tempered Genetic Algorithm Guided by Deep Neural Networks for Inverse Molecular Design"

A library for researching neural networks compression and acceleration methods.

Code for the Convolutional Vision Transformer (ConViT)

Keyword2Text This repository contains the code of the paper: "A Plug-and-Play Method for Controlled Text Generation"

Lightweight stereo matching network based on MobileNetV1 and MobileNetV2

Download and preprocess popular sequential recommendation datasets

Self-supervised learning algorithms provide a way to train Deep Neural Networks in an unsupervised way using contrastive losses

ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.

Compute FID scores with PyTorch.

Source code for CIKM 2021 paper for Relation-aware Heterogeneous Graph for User Profiling

A Python package for time series augmentation

The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGIR2022

Creating a Linear Program Solver by Implementing the Simplex Method in Python with NumPy

Adversarial examples to the new ConvNeXt architecture

An implementation for the loss function proposed in Decoupled Contrastive Loss paper.

3D position tracking for soccer players with multi-camera videos

This repo is to present various code demos on how to use our Graph4NLP library.

Demo code for paper "Learning optical flow from still images", CVPR 2021.

Code for the paper "Curriculum Dropout", ICCV 2017