SceneFormer: Indoor Scene Generation with Transformers

Initial code release for the Sceneformer paper, contains models, train and test scripts for the shape conditioned model. Text conditioned model and detailed README coming soon.

Please also check the project website here

Setup

Install the requirements in requirements.txt and environment.yaml in a conda environment. Packages that are common can be installed either through pip or conda.

Prepare Data

The SUNCG dataset is currently not available, hence all related files have been removed. The dataset can be prepared with the scripts which were taken from deepsynth.

Train

Configure the experiment in configs/scene_shift_X_config.yaml where X is one of cat, dim, loc, ori

Then run

python scene_scripts/train_shift_X_lt.py configs/scene_shift_X_config.yaml

to train the model X.

Test

Configure the model paths in scene_scripts/test.py and then run

python scene_scripts/test.py

If you find our work useful, please consider citing us:

@article{wang2020sceneformer,
  title={SceneFormer: Indoor Scene Generation with Transformers},
  author={Wang, Xinpeng and Yeshwanth, Chandan and Nie{\ss}ner, Matthias},
  journal={arXiv preprint arXiv:2012.09793},
  year={2020}
}

Generate indoor scenes with Transformers

Related tags

Overview

SceneFormer: Indoor Scene Generation with Transformers

Setup

Prepare Data

Train

Test

Owner

Chandan Yeshwanth

Using deep learning to predict gene structures of the coding genes in DNA sequences of Arabidopsis thaliana

DECAF: Deep Extreme Classification with Label Features

CS50x-AI - Artificial Intelligence with Python from Harvard University

Official implementation of Influence-balanced Loss for Imbalanced Visual Classification in PyTorch.

Pytorch implementation of Deep Recursive Residual Network for Super Resolution (DRRN)

Processed, version controlled history of Minecraft's generated data and assets

Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)

Recognize numbers from an (28 x 28) image using neural networks

Kaggle Feedback Prize - Evaluating Student Writing 15th solution

potpourri3d - An invigorating blend of 3D geometry tools in Python.

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

🧠 A PyTorch implementation of 'Deep CORAL: Correlation Alignment for Deep Domain Adaptation.', ECCV 2016

PyTorch Lightning implementation of Automatic Speech Recognition

SAMO: Streaming Architecture Mapping Optimisation

Code for "Sparse Steerable Convolutions: An Efficient Learning of SE(3)-Equivariant Features for Estimation and Tracking of Object Poses in 3D Space"

Dataset VSD4K includes 6 popular categories: game, sport, dance, vlog, interview and city.

Official codebase used to develop Vision Transformer, MLP-Mixer, LiT and more.

Pytorch implementation of the paper "Optimization as a Model for Few-Shot Learning"

Implementation of the method described in the Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

PyTorch implementation of the paper: "Preference-Adaptive Meta-Learning for Cold-Start Recommendation", IJCAI, 2021.