SMIS - Semantically Multi-modal Image Synthesis(CVPR 2020)

Last update: Dec 01, 2022

Related tags

Deep Learning SMIS

Overview

Semantically Multi-modal Image Synthesis

Project page / Paper / Demo

Semantically Multi-modal Image Synthesis(CVPR2020).
Zhen Zhu, Zhiliang Xu, Ansheng You, Xiang Bai

Requirements

torch>=1.0.0
torchvision
dominate
dill
scikit-image
tqdm
opencv-python

Getting Started

Data Preperation

DeepFashion
Note: We provide an example of the DeepFashion dataset. That is slightly different from the DeepFashion used in our paper due to the impact of the COVID-19.

Cityscapes
The Cityscapes dataset can be downloaded at here

ADE20K
The ADE20K dataset can be downloaded at here

Test/Train the models

Download the tar of the pretrained models from the Google Drive Folder. Save it in checkpoints/ and unzip it. There are deepfashion.sh, cityscapes.sh and ade20k.sh in the scripts folder. Change the parameters like --dataroot and so on, then comment or uncomment some code to test/train model. And you can specify the --test_mask for SMIS test.

Acknowledgments

Our code is based on the popular SPADE

SMIS - Semantically Multi-modal Image Synthesis(CVPR 2020)

Related tags

Overview

Semantically Multi-modal Image Synthesis

Project page / Paper / Demo

Requirements

Getting Started

Data Preperation

Test/Train the models

Acknowledgments

Owner

A Fast Sequence Transducer Implementation with PyTorch Bindings

Evaluation Pipeline for our ECCV2020: Journey Towards Tiny Perceptual Super-Resolution.

Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."

PoolFormer: MetaFormer is Actually What You Need for Vision

Multi agent DDPG algorithm written in Python + Pytorch

Cascaded Pyramid Network (CPN) based on Keras (Tensorflow backend)

Learning Open-World Object Proposals without Learning to Classify

EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling

imbalanced-DL: Deep Imbalanced Learning in Python

Companion code for "Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees"

Hunt down social media accounts by username across social networks

Introducing neural networks to predict stock prices

Apply a perspective transformation to a raster image inside Inkscape (no need to use an external software such as GIMP or Krita).

A transformer model to predict pathogenic mutations

PyDEns is a framework for solving Ordinary and Partial Differential Equations (ODEs & PDEs) using neural networks

PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks

Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.

Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"

Implements MLP-Mixer: An all-MLP Architecture for Vision.

Jremesh-tools - Blender addon for quad remeshing