VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Last update: Dec 26, 2022

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

3D-aware Image Synthesis via Learning Structural and Textural Representations
Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, Bolei Zhou
arXiv preprint arXiv:

[Paper] [Project Page] [Demo]

This paper aims at achieving high-fidelity 3D-aware images synthesis. We propose a novel framework, termed as VolumeGAN, for synthesizing images under different camera views, through explicitly learning a structural representation and a textural representation. We first learn a feature volume to represent the underlying structure, which is then converted to a feature field using a NeRF-like model. The feature field is further accumulated into a 2D feature map as the textural representation, followed by a neural renderer for appearance synthesis. Such a design enables independent control of the shape and the appearance. Extensive experiments on a wide range of datasets show that our approach achieves sufficiently higher image quality and better 3D control than the previous methods.

Qualitative Results

Independent control of structure (shape) and texture (appearance).

Comparison to prior work on various datasets.

Code Coming Soon

BibTeX

@article{xu2021volumegan,
  title   = {3D-aware Image Synthesis via Learning Structural and Textural Representations},
  author  = {Xu, Yinghao and Peng, Sida and Yang, Ceyuan and Shen, Yujun and Zhou, Bolei},
  article = {arXiv preprint arXiv:2112.10759},
  year    = {2021}
}

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Qualitative Results

Code Coming Soon

BibTeX

Owner

GenForce: May Generative Force Be with You

This is the official PyTorch implementation for "Mesa: A Memory-saving Training Framework for Transformers".

Assessing syntactic abilities of BERT

Code for ACM MM 2020 paper "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination"

Machine Unlearning with SISA

ML-PersonalWork - Big assignment PersonalWork in Machine Learning, 2021 autumn BUAA.

This repository contains FEDOT - an open-source framework for automated modeling and machine learning (AutoML)

STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech

GT China coal model

Koopman operator identification library in Python

Lane follower: Lane-detector (OpenCV) + Object-detector (YOLO5) + CAN-bus

Companion code for the paper "Meta-Learning the Search Distribution of Black-Box Random Search Based Adversarial Attacks" by Yatsura et al.

Use AI to generate a optimized stock portfolio

Multi-Modal Fingerprint Presentation Attack Detection: Evaluation On A New Dataset

Light-weight network, depth estimation, knowledge distillation, real-time depth estimation, auxiliary data.

Official DGL implementation of "Rethinking High-order Graph Convolutional Networks"

PyTorch implementation for our paper "Deep Facial Synthesis: A New Challenge"

A LiDAR point cloud cluster for panoptic segmentation

CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer

Complete* list of autonomous driving related datasets

PyTorch implementations of Top-N recommendation, collaborative filtering recommenders.