Official implementation of Self-supervised Image-to-text and Text-to-image Synthesis

Last update: Jul 31, 2022

Related tags

Overview

Self-supervised Image-to-text and Text-to-image Synthesis

This is the official implementation of Self-supervised Image-to-text and Text-to-image Synthesis. The architecture of and are shown.

Dataset

We use Caltech-UCSD Birds-200-2011 and Oxford-102 datasets in this work.

Download Flower images
Rename the jpg folder to images and unzip 102flowers.zip and put it inside 102flowers folder
put 102flowers folder inside data folder
Download Birds data and put inside Data/
Download image data Extract them to Data/birds/

Dependencies

pytorch
torchvision
tensorboardX
pickle

Training

Training the image autoencoder

The driver program for training the image autoencoder is main.py

To train the image autoencoder on flower dataset

python main.py --cfg cfg/flowers_3stages.yml --gpu 0

To train the image autoencoder birds dataset

python main.py --cfg cfg/birds_3stages.yml --gpu 0

Models will automatically saved after a fixed number of iteration, to restart from a failed step edit netG_version in respective .yml file

Training the text autoencoder

python run_text_test.py dataset_type Input_Folder output_file.txt

For Flower Dataset dataset_type=1, for Birds Dataset dataset_type=2 e.g.

python run_text_test.py 2 /home/user/dev/unsup/data_datasets/CUB_200_2011 outbirds_n.txt

Training the mapping networks

Train the GAN-based mapping network

python MappingImageText.py Dataset_folder

e.g.

python MappingImageText.py /home/user/dev/unsup/data_datasets/CUB_200_2011

Train the MMD-based mapping network

python mmd_ganTI.py --dataset /home/das/dev/data_datasets/birds_dataset/CUB_200_2011 --gpu_device 0

python mmd_ganIT.py --dataset /home/das/dev/data_datasets/birds_dataset/CUB_200_2011 --gpu_device 0

Official implementation of Self-supervised Image-to-text and Text-to-image Synthesis

Related tags

Overview

Self-supervised Image-to-text and Text-to-image Synthesis

Dataset

Dependencies

Training

Training the image autoencoder

To train the image autoencoder on flower dataset

To train the image autoencoder birds dataset

Training the text autoencoder

Training the mapping networks

Train the GAN-based mapping network

Train the MMD-based mapping network

Owner

TICC is a python solver for efficiently segmenting and clustering a multivariate time series

Sequence modeling benchmarks and temporal convolutional networks

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains (IJCV submission)

Voice assistant - Voice assistant with python

A python script to lookup Passport Index Dataset

Block Sparse movement pruning

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

Supplementary code for SIGGRAPH 2021 paper: Discovering Diverse Athletic Jumping Strategies

[NeurIPS 2021] A weak-shot object detection approach by transferring semantic similarity and mask prior.

Official implementation of the paper DeFlow: Learning Complex Image Degradations from Unpaired Data with Conditional Flows

Quadruped-command-tracking-controller - Quadruped command tracking controller (flat terrain)

GeDML is an easy-to-use generalized deep metric learning library

Library for fast text representation and classification.

OpenMMLab Pose Estimation Toolbox and Benchmark.

Static Features Classifier - A static features classifier for Point-Could clusters using an Attention-RNN model

[CIKM 2021] Enhancing Aspect-Based Sentiment Analysis with Supervised Contrastive Learning

Cascading Feature Extraction for Fast Point Cloud Registration (BMVC 2021)

Manim is an engine for precise programmatic animations, designed for creating explanatory math videos

Retinal vessel segmentation based on GT-UNet