Implementation of paper "DeepTag: A General Framework for Fiducial Marker Design and Detection"

Last update: Dec 12, 2022

Related tags

Deep Learning deeptag-pytorch

Overview

Implementation of paper DeepTag: A General Framework for Fiducial Marker Design and Detection.

Project page: https://herohuyongtao.github.io/research/publications/deep-tag/.

Overview

DeepTag is a general framework for fiducial marker design and detection, which supports existing and newly-designed marker families. DeepTag is a two-stage marker detection pipeline:

Stage-1: detect ROIs of potential markers;
Stage-2: detect keypoints and digital symbols inside each ROI, then determine 6-DoF pose and marker ID.

How to run

For image input:

python test_deeptag.py --config config_image.json

For video input:

python test_deeptag.py --config config_video.json

The configuration file is in JSON format. Please modify the configurations to fit your needs. Example configurations files for image and video input are provided (i.e., config_image.json and config_video.json).

Detail explaination of configuration file:

is_video: {0, 1} for image/video respectively.
filepath: path of input image/video (use 0 for webcam input).
family: marker family, currently support {apriltag, aruco, artoolkitplus, runetag, topotag, apriltagxo}.
hamming_dist: Hamming dist for checking the marker library; normally, 4 works well enough.
codebook: path of codebook; if it is empty, the default path codebook/FAMILY_codebook.txt will be used. For markers with multiple codebooks like AprilTag and ArUco, their default codebooks are for AprilTag (36h11) and ArUco (36h12) respectively.
cameraMatrix: camera intrinsic matrix, [fx, 0, cx, 0, fy, cy, 0, 0, 1].
distCoeffs: camera distortion coefficients (both radial and tangential), [k1, k2, p1, p2, k3, k4, k5, k6].
marker_size: physical size of the marker.

Besides supporting existing markers like AprilTag, ArUco, ARToolkitPlus, TopoTag & RuneTag, DeepTag also supports newly-designed markers like AprilTag-XO, AprilTag-XA and RuneTag+ (provided in folders images_tag). Set family to apriltagxo in config for AprilTag-XO and AprilTag-XA, and runetag for RuneTag+ respectively.

Terms of use

The source code is provided for research purposes only. Any commercial use is prohibited. When using the code in your research work, please cite the following paper:

"DeepTag: A General Framework for Fiducial Marker Design and Detection."
Zhuming Zhang, Yongtao Hu, Guoxing Yu, and Jingwen Dai
arXiv:2105.13731 (2021).

@article{zhang2021deeptag,
  title={{DeepTag: A General Framework for Fiducial Marker Design and Detection}},
  author={Zhang, Zhuming and Hu, Yongtao and Yu, Guoxing and Dai, Jingwen},
  year={2021},
  eprint={2105.13731},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

Contact

If you find any bug or have any question about the code, please report to the Issues page.

Implementation of paper "DeepTag: A General Framework for Fiducial Marker Design and Detection"

Related tags

Overview

Overview

How to run

Terms of use

Contact

Owner

Yongtao Hu

HCQ: Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval

Official PyTorch implementation of Retrieve in Style: Unsupervised Facial Feature Transfer and Retrieval.

Pixel-level Crack Detection From Images Of Levee Systems : A Comparative Study

Joint detection and tracking model named DEFT, or ``Detection Embeddings for Tracking.

Repo for WWW 2022 paper: Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval

YOLTv5 rapidly detects objects in arbitrarily large aerial or satellite images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks

AISTATS 2019: Confidence-based Graph Convolutional Networks for Semi-Supervised Learning

Official code for our ICCV paper: "From Continuity to Editability: Inverting GANs with Consecutive Images"

Implementation of fast algorithms for Maximum Spanning Tree (MST) parsing that includes fast ArcMax+Reweighting+Tarjan algorithm for single-root dependency parsing.

Signals-backend - A suite of card games written in Python

Easy-to-use library to boost AI inference leveraging state-of-the-art optimization techniques.

A new framework, collaborative cascade prediction based on graph neural networks (CCasGNN) to jointly utilize the structural characteristics, sequence features, and user profiles.

Benchmarks for semi-supervised domain generalization.

Implementation of paper "Self-supervised Learning on Graphs:Deep Insights and New Directions"

Simple tutorials on Pytorch DDP training

[CVPR 2022] Official Pytorch code for OW-DETR: Open-world Detection Transformer

Proof-Of-Concept Piano-Drums Music AI Model/Implementation

Little Ball of Fur - A graph sampling extension library for NetworKit and NetworkX (CIKM 2020)

A new play-and-plug method of controlling an existing generative model with conditioning attributes and their compositions.

CVPR2021 Content-Aware GAN Compression