Code release for the ICML 2021 paper "PixelTransformer: Sample Conditioned Signal Generation".

Last update: Dec 17, 2022

Related tags

Deep Learning PixelTransformer

Overview

PixelTransformer

Code release for the ICML 2021 paper "PixelTransformer: Sample Conditioned Signal Generation".

Project Page

Installation

Please install pytorch and pytorch3d before the following steps.

pip install hydra-core --upgrade
pip install pytorch-lightning
pip install imageio scikit-image

mkdir external; cd external;
git clone [email protected]:kuangliu/pytorch-cifar.git
# if interested in evaluating CIFAR classification accuracy, please train a Resnet-18 model from this repo

Please modify the paths in the config files.

Training

See the sample commands in experiments/s2s.py

Evaluating

See the sample commands in benchmark/

Preprocessing Data

Most of the image datasets used correspond to standard torchvision datasets. The cat dataset used is from Wu. etal's CVPR 2020 work, and can be downloaded using their provided script.

To extract SDF values for the ShapeNet experiments, we followed the preprocessing steps from DISN although with some modifications to the extraction file. Please use our modified preprocessing file instead for reproducibility.

Code release for the ICML 2021 paper "PixelTransformer: Sample Conditioned Signal Generation".

Related tags

Overview

PixelTransformer

Installation

Training

Evaluating

Preprocessing Data

Owner

Shubham Tulsiani

Deep Residual Networks with 1K Layers

Incremental Cross-Domain Adaptation for Robust Retinopathy Screening via Bayesian Deep Learning

The codes and related files to reproduce the results for Image Similarity Challenge Track 2.

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

Unofficial PyTorch implementation of Neural Additive Models (NAM) by Agarwal, et al.

Public repo for the ICCV2021-CVAMD paper "Is it Time to Replace CNNs with Transformers for Medical Images?"

Code for Multiple Instance Active Learning for Object Detection, CVPR 2021

中文语音识别系列，读者可以借助它快速训练属于自己的中文语音识别模型，或直接使用预训练模型测试效果。

[CVPR 2021] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

WiFi-based Multi-task Sensing

DTCN SMP Challenge - Sequential prediction learning framework and algorithm

DiSECt: Differentiable Simulator for Robotic Cutting

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

Official Python implementation of the FuzionCoin protocol

This is a repository of our model for weakly-supervised video dense anticipation.

AntiFuzz: Impeding Fuzzing Audits of Binary Executables

Bringing sanity to world of messed-up data

Tensorflow port of a full NetVLAD network