Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian (CVPR 2022)

Last update: Nov 22, 2022

Related tags

Deep Learning Pop-Out-Motion

Overview

Pop-Out Motion

Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian (CVPR 2022)

Jihyun Lee*, Minhyuk Sung*, Hyunjin Kim, Tae-Kyun (T-K) Kim (*: equal contributions)

[Project Page] [Paper] [Video]

We present a framework that can deform an object in a 2D image as it exists in 3D space. While our method leverages 2D-to-3D reconstruction, we argue that reconstruction is not sufficient for realistic deformations due to the vulnerability to topological errors. Thus, we propose to take a supervised learning-based approach to predict the shape Laplacian of the underlying volume of a 3D reconstruction represented as a point cloud. Given the deformation energy calculated using the predicted shape Laplacian and user-defined deformation handles (e.g., keypoints), we obtain bounded biharmonic weights to model plausible handle-based image deformation.

Environment Setup

Clone this repository and install the dependencies specified in requirements.txt.

 git clone https://github.com/jyunlee/Pop-Out-Motion.git
 mv Pop-Out-Motion
 pip install -r requirements.txt

Data Pre-Processing

Training Data

Build executables from the c++ files in data_preprocessing directory. After running the commands below, you should have normalize_bin and calc_l_minv_bin executables.

 cd data_preprocessing
 mkdir build
 cd build
 cmake ..
 make
 cd ..

Clone and build Manifold repository to obtain manifold executable.
Clone and build fTetWild repository to obtain FloatTetwild_bin executable.
Run preprocess_train_data.py to prepare your training data. This should perform (1) shape normalization into a unit bounding sphere, (2) volume mesh conversion, and (3) cotangent Laplacian and inverse mass calculation.

 python preprocess_train_data.py

Test Data

Build executables from the c++ files in data_preprocessing directory. After running the commands below, you should have normalize_bin executable.

 cd data_preprocessing
 mkdir build
 cd build
 cmake ..
 make
 cd ..

Run preprocess_test_data.py to prepare your test data. This should perform (1) shape normalization into a unit bounding sphere and (2) pre-computation of KNN-Based Point Pair Sampling (KPS).

 python preprocess_test_data.py

Network Training

Run network/train.py to train your own Laplacian Learning Network.

 cd network
 python train.py

The pre-trained model on DFAUST dataset is also available here.

Network Inference

Deformation Energy Inference

Given an input image, generate its 3D reconstruction via running PIFu. It is also possible to directly use point cloud data obtained from other sources.
Pre-process the data obtained from Step 1 -- please refer to this section.
Run network/a_inference.py to predict the deformation energy matrix.

 cd network
 python a_inference.py

Handle-Based Deformation Weight Calculation

Build an executable from the c++ file in bbw_calculation directory. After running the commands below, you should have calc_bbw_bin executable.

 cd bbw_calculation
 mkdir build
 cd build
 cmake ..
 make
 cd ..

(Optional) Run sample_pt_handles.py to obtain deformation control handles sampled by farthest point sampling.
Run calc_bbw_bin to calculate handle-based deformation weights using the predicted deformation energy.

./build/calc_bbw_bin <shape_path> <handle_path> <deformation_energy_path> <output_weight_path>

Citation

If you find this work useful, please consider citing our paper.

@InProceedings{lee2022popoutmotion,
    author = {Lee, Jihyun and Sung, Minhyuk and Kim, Hyunjin and Kim, Tae-Kyun},
    title = {Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian},
    booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
    year = {2022}
}

Acknowledgements

Parts of our data-preprocessing code are adopted from DeepMetaHandles.
Parts of our network code are adopted from Point-Transformer.

Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian (CVPR 2022)

Related tags

Overview

Pop-Out Motion

Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian (CVPR 2022)

Environment Setup

Data Pre-Processing

Training Data

Test Data

Network Training

Network Inference

Citation

Acknowledgements

Owner

Jihyun Lee

The official implementation of the CVPR2021 paper: Decoupled Dynamic Filter Networks

In this project we use both Resnet and Self-attention layer for cat, dog and flower classification.

Aydin is a user-friendly, feature-rich, and fast image denoising tool

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

Code for the head detector (HeadHunter) proposed in our CVPR 2021 paper Tracking Pedestrian Heads in Dense Crowd.

Google AI Open Images - Object Detection Track: Open Solution

Single-Stage Instance Shadow Detection with Bidirectional Relation Learning (CVPR 2021 Oral)

Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Config files for my GitHub profile.

Creating multimodal multitask models

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

This is a pytorch implementation for the BST model from Alibaba https://arxiv.org/pdf/1905.06874.pdf

A tiny, friendly, strong baseline code for Person-reID (based on pytorch).

The FIRST GANs-based omics-to-omics translation framework

Code to reproduce results from the paper "AmbientGAN: Generative models from lossy measurements"

Official code of paper "PGT: A Progressive Method for Training Models on Long Videos" on CVPR2021

Code for: Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification

Code basis for the paper "Camera Condition Monitoring and Readjustment by means of Noise and Blur" (2021)

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"