Learning 3D Part Assembly from a Single Image

Last update: Dec 21, 2022

Related tags

Overview

Learning 3D Part Assembly from a Single Image

This repository contains a PyTorch implementation of the paper:

Learning 3D Part Assembly from A Single Image.
Yichen Li*, Kaichun Mo*, Lin Shao, Minhyuk Sung, Leonidas Guibas,
ECCV 2020

Introduction

Autonomous assembly is a crucial capability for robots in many applications. For this task, several problems such as obstacle avoidance, motion planning, and actuator control have been extensively studied in robotics. However, when it comes to task specification, the space of possibilities remains underexplored. Towards this end, we introduce a novel problem, single-image-guided 3D part assembly, along with a learningbased solution. We study this problem in the setting of furniture assembly from a given complete set of parts and a single image depicting the entire assembled object. Multiple challenges exist in this setting, including handling ambiguity among parts (e.g., slats in a chair back and leg stretchers) and 3D pose prediction for parts and part subassemblies, whether visible or occluded. We address these issues by proposing a two-module pipeline that leverages strong 2D-3D correspondences and assembly-oriented graph message-passing to infer part relationships. In experiments with a PartNet-based synthetic benchmark, we demonstrate the effectiveness of our framework as compared with three baseline approaches.

Dependencies

Python 3.6
CUDA 10.0.
PyTorch. code tested with version 1.3.1
Blender. for visualization of results 2.7.9
(Optional) Tensorboard for visualization of the training process.
For the project it has been used TensorboardX

pip install -r requirements.txt

Chamfer Distance

cd exps/utils/cd
python setup.py install

Dataset

Data is available here: link.

wget http://download.cs.stanford.edu/orion/impartass/assembly_data.zip

Training

Training the segmentation stage first

cd exps/exp_segmentation
sh train.sh

modify your parameters including data_path, exp_name and etc. (see closed issues for details info)

Training the assembly stage

cd exps/exp_assemble
sh train.sh

Pre-trained models

Pretrained weights for the chair category is available at link.

wget http://download.cs.stanford.edu/orion/impartass/chair_weights.zip

Cite

Please cite our work if you find it useful:

@article{li2020impartass,
    title={Learning 3D Part Assembly from a Single Image},
    author={Li, Yichen and Mo, Kaichun and Shao, Lin and Sung, Minghyuk and Guibas, Leonidas},
    journal={European conference on computer vision (ECCV 2020)},
    year={2020}
}

Learning 3D Part Assembly from a Single Image

Related tags

Overview

Learning 3D Part Assembly from a Single Image

Introduction

Dependencies

Dataset

Training

Training the segmentation stage first

Training the assembly stage

Pre-trained models

Cite

Owner

Torch-based tool for quantizing high-dimensional vectors using additive codebooks

Compressed Video Action Recognition

Face recognition system using MTCNN, FACENET, SVM and FAST API to track participants of Big Brother Brasil in real time.

Code for the CVPR2021 paper "Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition"

A variational Bayesian method for similarity learning in non-rigid image registration (CVPR 2022)

Multiview 3D object detection on MultiviewC dataset through moft3d.

Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"

git git《Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking》(CVPR 2021) GitHub:git2] 《Masksembles for Uncertainty Estimation》(CVPR 2021) GitHub:git3]

Official implementation of the paper Label-Efficient Semantic Segmentation with Diffusion Models

Code for testing various M1 Chip benchmarks with TensorFlow.

Self-Supervised Image Denoising via Iterative Data Refinement

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

novel deep learning research works with PaddlePaddle

FastFCN: Rethinking Dilated Convolution in the Backbone for Semantic Segmentation.

Machine learning evaluation metrics, implemented in Python, R, Haskell, and MATLAB / Octave

Machine Learning automation and tracking

Scalable Graph Neural Networks for Heterogeneous Graphs

Strongly local p-norm-cut algorithms for semi-supervised learning and local graph clustering

Use evolutionary algorithms instead of gridsearch in scikit-learn

Quantized models with python