sfp-wild

Implementation for Shape from Polarization for Complex Scenes in the Wild

project website | paper

Code and dataset will be released soon.

Introduction

We present a new data-driven approach with physics-based priors to scene-level normal estimation from a single polarization image. Existing shape from polarization (SfP) works mainly focus on estimating the normal of a single object rather than complex scenes in the wild. A key barrier to high-quality scene-level SfP is the lack of real-world SfP data in complex scenes. Hence, we contribute the first real-world scene-level SfP dataset with paired input polarization images and ground-truth normal maps. Then we propose a learning-based framework with a multi-head self-attention module and viewing encoding, which is designed to handle increasing polarization ambiguities caused by complex materials and non-orthographic projection in scene-level SfP. Our trained model can be generalized to far-feld outdoor scenes as the relationship between polarized light and surface normals is not affected by distance. Experimental results demonstrate that our approach significantly outperforms existing SfP models on two datasets.

Citation

If you find this work useful for your research, please cite:

@article{lei2021shape,
    title={Shape from Polarization for Complex Scenes in the Wild}, 
    author={Chenyang Lei and Chenyang Qi and Jiaxin Xie and Na Fan and Vladlen Koltun and Qifeng Chen},
    year={2021},
    journal={arXiv: 2112.11377},
}

Contact

Please contact us if there is any question (Chenyang Lei, [email protected]; Chenyang Qi, [email protected])

Implementation for Shape from Polarization for Complex Scenes in the Wild

Related tags

Overview

sfp-wild

Introduction

Citation

Contact

Owner

Chenyang LEI

Codebase for the Summary Loop paper at ACL2020

(AAAI2020)Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing

Hippocampal segmentation using the UNet network for each axis

A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset.

Rocket-recycling with Reinforcement Learning

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Open source code for Paper "A Co-Interactive Transformer for Joint Slot Filling and Intent Detection"

Faster Convex Lipschitz Regression

Character Controllers using Motion VAEs

Mahadi-Now - This Is Pakistani Just Now Login Tools

Causal Imitative Model for Autonomous Driving

BEGAN in PyTorch

Imaging, analysis, and simulation software for radio interferometry

BigbrotherBENL - Face recognition on the Big Brother episodes in Belgium and the Netherlands.

In generative deep geometry learning, we often get many obj files remain to be rendered

A gesture recognition system powered by OpenPose, k-nearest neighbours, and local outlier factor.

Geometric Algebra package for JAX

Project Aquarium is a SUSE-sponsored open source project aiming at becoming an easy to use, rock solid storage appliance based on Ceph.

Repository for the paper "PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation", CVPR 2021.

implement of SwiftNet:Real-time Video Object Segmentation