VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Last update: Dec 26, 2022

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

3D-aware Image Synthesis via Learning Structural and Textural Representations
Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, Bolei Zhou
arXiv preprint arXiv:

[Paper] [Project Page] [Demo]

This paper aims at achieving high-fidelity 3D-aware images synthesis. We propose a novel framework, termed as VolumeGAN, for synthesizing images under different camera views, through explicitly learning a structural representation and a textural representation. We first learn a feature volume to represent the underlying structure, which is then converted to a feature field using a NeRF-like model. The feature field is further accumulated into a 2D feature map as the textural representation, followed by a neural renderer for appearance synthesis. Such a design enables independent control of the shape and the appearance. Extensive experiments on a wide range of datasets show that our approach achieves sufficiently higher image quality and better 3D control than the previous methods.

Qualitative Results

Independent control of structure (shape) and texture (appearance).

Comparison to prior work on various datasets.

Code Coming Soon

BibTeX

@article{xu2021volumegan,
  title   = {3D-aware Image Synthesis via Learning Structural and Textural Representations},
  author  = {Xu, Yinghao and Peng, Sida and Yang, Ceyuan and Shen, Yujun and Zhou, Bolei},
  article = {arXiv preprint arXiv:2112.10759},
  year    = {2021}
}

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Qualitative Results

Code Coming Soon

BibTeX

Owner

GenForce: May Generative Force Be with You

CAMoE + Dual SoftMax Loss (DSL): Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss

Unofficial Implementation of MLP-Mixer in TensorFlow

Global Rhythm Style Transfer Without Text Transcriptions

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Predicting Price of house by considering ,house age, Distance from public transport

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]

Official PyTorch implementation of paper: Standardized Max Logits: A Simple yet Effective Approach for Identifying Unexpected Road Obstacles in Urban-Scene Segmentation (ICCV 2021 Oral Presentation)

Production First and Production Ready End-to-End Speech Recognition Toolkit

Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

This repository is for the preprint "A generative nonparametric Bayesian model for whole genomes"

Info and sample codes for "NTU RGB+D Action Recognition Dataset"

Gradient representations in ReLU networks as similarity functions

StrongSORT: Make DeepSORT Great Again

Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.

Gauge equivariant mesh cnn

Implementation of ReSeg using PyTorch

Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving

Pytorch implementation for the paper: Contrastive Learning for Cold-start Recommendation

GARCH and Multivariate LSTM forecasting models for Bitcoin realized volatility with potential applications in crypto options trading, hedging, portfolio management, and risk management

Unofficial PyTorch implementation of TokenLearner by Google AI