Code for "CloudAAE: Learning 6D Object Pose Regression with On-line Data Synthesis on Point Clouds" @ICRA2021

Last update: Nov 14, 2022

Related tags

Overview

CloudAAE

This is an tensorflow implementation of "CloudAAE: Learning 6D Object Pose Regression with On-line Data Synthesis on Point Clouds"

Files

log: directory to store log files during training.
losses: loss functions for training.
models: a python file defining model structure.
object_model_tfrecord: full object models for data synthesizing and visualization purpose.
tf_ops: tensorflow implementation of sampling operations (credit: Haoqiang Fan, Charles R. Qi).
trained_network: a trained network.
utils: utility files for defining model structure.
ycb_video_data_tfRecords: synthetic training data and real test data for the YCB video dataset.
evaluate_cloudAAE_ycbv.py: script for testing object 6d pose estimation with a trained network on test set in YCB video dataset.
train_cloudAAE_ycbv.py: script for training a network on synthetic data for YCB objects.

Requirements

Tensorflow-GPU (tested with 1.12.0)
transforms3d
open3d for visualization

Test a trained network

Testing data in tfrecord format is available

Download zip file
Unzip and place all files in ycb_video_data_tfRecords/test_real/

After activate tensorflow

python evaluate_cloudAAE_ycbv.py --trained_model trained_network/20200908-204328/model.ckpt --batch_size 1 --target_cls 0

--trained_model: directory to trained model (*.ckpt).
--batch_size: 1.
--target_class: target class for pose estimation.
Translation prediction is in unit meter.
Rotation prediction is in axis-angle format.

Result

If you turn on visualization with b_visual=True, you will see the following displays which are partially observed point cloud segments (red) overlaid with object model (green) with pose estimates. The reconstructed point cloud is also presented (blue).
The coordinate is the object coordinate, object segment is viewed in the camera coordinate

Train a network

Training data is created synthetically using 3D object model and 6D poses.

The 6D pose and class id of target object are in ycb_video_data_tfRecords/train_syn/
The data synthesis pipeline takes the target 3D object model and creates a segment of the object in the desired 6D pose. Below is two examples of synthetic segment (red), two real segments (red) are also shown for comparison.

Run script

python train_cloudAAE_ycbv.py

Log files and trained model is store in log

Citation

If you use this code in an academic context, please consider cite the paper:

BiBTeX:

@inproceedings{gao2020cloudpose,
      title={CloudAAE: Learning 6D Object Pose Regression with On-line Data
Synthesis on Point Clouds},
      author={G. Gao, M. Lauri, X. Hu, J. Zhang and S. Frintrop},
      booktitle={ICRA},
      year={2021}
    }

Link to Paper

TBA

Acknowledgement

The building block for this system is PointNet and Dynamic Graph.

Code for "CloudAAE: Learning 6D Object Pose Regression with On-line Data Synthesis on Point Clouds" @ICRA2021

Related tags

Overview

CloudAAE

Files

Requirements

Test a trained network

Train a network

Citation

Link to Paper

Acknowledgement

Owner

Gee

上海交通大学全自动抢课脚本，支持准点开抢与抢课后持续捡漏两种模式。2021/06/08更新。

Code release for the ICML 2021 paper "PixelTransformer: Sample Conditioned Signal Generation".

Text-to-Image generation

Reinforcement learning for self-driving in a 3D simulation

CLIP + VQGAN / PixelDraw

Reviving Iterative Training with Mask Guidance for Interactive Segmentation

Official code of the paper "ReDet: A Rotation-equivariant Detector for Aerial Object Detection" (CVPR 2021)

Boundary-aware Transformers for Skin Lesion Segmentation

Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)

Provide partial dates and retain the date precision through processing

Improving Object Detection by Estimating Bounding Box Quality Accurately

Code for testing various M1 Chip benchmarks with TensorFlow.

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

Pytorch-Swin-Unet-V2 - a modified version of Swin Unet based on Swin Transfomer V2

Kaggle-titanic - A tutorial for Kaggle's Titanic: Machine Learning from Disaster competition. Demonstrates basic data munging, analysis, and visualization techniques. Shows examples of supervised machine learning techniques.

Official repository for CVPR21 paper "Deep Stable Learning for Out-Of-Distribution Generalization".

Json2Xml tool will help you convert from json COCO format to VOC xml format in Object Detection Problem.

SOTR: Segmenting Objects with Transformers [ICCV 2021]

Code for our WACV 2022 paper "Hyper-Convolution Networks for Biomedical Image Segmentation"

DCGAN LSGAN WGAN-GP DRAGAN PyTorch