Python and C++ implementation of "MarkerPose: Robust real-time planar target tracking for accurate stereo pose estimation". Accepted at LXCV @ CVPR 2021.

Last update: Nov 18, 2022

Related tags

Deep Learning MarkerPose

Overview

MarkerPose: Robust real-time planar target tracking for accurate stereo pose estimation

This is a PyTorch and LibTorch implementation of MarkerPose: a robust, real-time pose estimation method based on a planar marker of three circles and a calibrated stereo vision system for high-accuracy pose estimation.

MarkerPose method consists of three stages. In the first stage, marker points in a pixel-level accuracy, and their IDs are estimated with a SuperPoint-like network for both views. In the second stage, three square patches that contain each ellipse of the target are extracted centered in the rough 2D locations previously estimated. With EllipSegNet the contour of the ellipses is segmented for sub-pixel-level centroid estimation for the first and second view. Finally, in the last stage, with the sub-pixel matches of both views, triangulation is applied for 3D pose estimation. For more details see our paper.

Pose estimation example

To run the Python or C++ pose estimation examples, you need first to clone this repository and download the dataset. This dataset contains the stereo calibration parameters, stereo images, and pretrained weights for SuperPoint and EllipSegNet.

Clone this repo: git clone https://github.com/jhacsonmeza/MarkerPose
Download the dataset here.
Move the dataset/ folder to the cloned repo folder: mv path/to/dataset/ MarkerPose/.

The folder structure into MarkerPose/ directory should be:

MarkerPose
    ├── C++
    ├── dataset
    ├── figures
    └── Python

To know how to run the pose estimation examples, see the Python/ folder for the PyTorch version, and the C++/ folder the LibTorch version. Furthermore, the code for training SuperPoint and EllipSegNet is also available in both versions.

Citation

If you find this code useful, please consider citing:

@inproceedings{meza2021markerpose,
  title={MarkerPose: Robust Real-time Planar Target Tracking for Accurate Stereo Pose Estimation},
  author={Meza, Jhacson and Romero, Lenny A and Marrugo, Andres G},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year={2021}
}

Python and C++ implementation of "MarkerPose: Robust real-time planar target tracking for accurate stereo pose estimation". Accepted at LXCV @ CVPR 2021.

Related tags

Overview

MarkerPose: Robust real-time planar target tracking for accurate stereo pose estimation

Pose estimation example

Citation

Owner

Jhacson Meza

Kaggle Ultrasound Nerve Segmentation competition [Keras]

🕵 Artificial Intelligence for social control of public administration

Official pytorch implementation of "DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion"

Link prediction using Multiple Order Local Information (MOLI)

On the Limits of Pseudo Ground Truth in Visual Camera Re-Localization

NAS-FCOS: Fast Neural Architecture Search for Object Detection (CVPR 2020)

Differentiable Simulation of Soft Multi-body Systems

3D AffordanceNet is a 3D point cloud benchmark consisting of 23k shapes from 23 semantic object categories, annotated with 56k affordance annotations and covering 18 visual affordance categories.

Visualization toolkit for neural networks in PyTorch! Demo -->

Pun Detection and Location

Kaggle | 9th place single model solution for TGS Salt Identification Challenge

Time Series Forecasting with Temporal Fusion Transformer in Pytorch

A High-Level Fusion Scheme for Circular Quantities published at the 20th International Conference on Advanced Robotics

Pytorch implementation of "Geometrically Adaptive Dictionary Attack on Face Recognition" (WACV 2022)

Information Gain Filtration (IGF) is a method for filtering domain-specific data during language model finetuning. IGF shows significant improvements over baseline fine-tuning without data filtration.

Code for the Convolutional Vision Transformer (ConViT)

Simulated garment dataset for virtual try-on

A3C LSTM Atari with Pytorch plus A3G design

This repository contains all code and data for the Inside Out Visual Place Recognition task

A project studying the influence of communication in multi-objective normal-form games