Official code for 'Robust Siamese Object Tracking for Unmanned Aerial Manipulator' and offical introduction to UAMT100 benchmark

Last update: Dec 18, 2022

Related tags

Deep Learning SiamSA

Overview

SiamSA: Robust Siamese Object Tracking for Unmanned Aerial Manipulator

Demo video

📹 Our video on Youtube and bilibili demonstrates the evaluation of SiamSA and other 4 state-of-the-art trackers on [email protected] and UAMT100 benchmark.

📹 Real-world tests of SiamSA on a flying UAM platform form first and third perspective are also involved.

UAMT100 benchmark

The UAMT100 benchmark consists of 100 image sequences, which are captured from UAM perspectives. For subsequent tasks of UAM tracking, such as grasping, it represents various possibilities of UAM's tracking the object in an indoor environment.

16 kinds of objects are involved, and 11 attributes are annotated for each sequence. The figure demonstrates four scenarios of UAM tracking in UAMT100. The histogram in the figure is a statistic of attributes in UAMT100.
For more detail, please refer to the benchmark website, which will be released soon.

Environment setup

This code has been tested on Ubuntu 18.04, Python 3.8.3, Pytorch 0.7.0/1.6.0, CUDA 10.2. Please install related libraries before running this code:

pip install -r requirements.txt

Test

Download model from Google Drive or BaiduYun (code: v4r0) and put it into tools/snapshot directory.

Download testing datasets and put them into test_dataset directory. If you want to test the tracker on a new dataset, please refer to pysot-toolkit to set test_dataset.

python test.py 	                    \
	--trackername SiamSA            \ # tracker_name
	--dataset UAV123_10fps          \ # dataset_name
	--snapshot snapshot/model.pth     # model_path

The testing result will be saved in the results/dataset_name/tracker_name directory.

We provide our test results on Google Drive and BaiduYun (code: v4r1).

Train

Prepare training datasets

Download the datasets：

VID
YOUTUBEBB (code: t7j8)
COCO
GOT-10K

Note: train_dataset/dataset_name/readme.md has listed detailed operations about how to generate training datasets.

Train a model

To train the SiamSA model, run train.py with the desired configs:

cd tools
python train.py

Evaluation

If you want to evaluate the tracker mentioned above, please put those results into results directory.

python eval.py 	                      \
	--tracker_path ./results          \ # result path
	--dataset UAV123_10fps            \ # dataset_name
	--tracker_prefix 'model'            # tracker_name

Contact

If you have any questions, please contact me.

Guangze Zheng

Email: [email protected]

Acknowledgement

The code is implemented based on pysot and SiamAPN. We would like to express our sincere thanks to the contributors.
Besides, we would like to thank Ziang Cao for his advice on the code.
As for UAMT100 benchmark, we appreciate the help from Fuling Lin, Haobo Zuo, and Liangliang Yao.
We would like to thank Kunhan Lu for his advice on TensorRT acceleration.

Official code for 'Robust Siamese Object Tracking for Unmanned Aerial Manipulator' and offical introduction to UAMT100 benchmark

Related tags

Overview

SiamSA: Robust Siamese Object Tracking for Unmanned Aerial Manipulator

Demo video

UAMT100 benchmark

Environment setup

Test

Train

Prepare training datasets

Train a model

Evaluation

Contact

Acknowledgement

Owner

Intelligent Vision for Robotics in Complex Environment

BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting

Code repository for Semantic Terrain Classification for Off-Road Autonomous Driving

Charsiu: A transformer-based phonetic aligner

Code for "Neural 3D Scene Reconstruction with the Manhattan-world Assumption" CVPR 2022 Oral

Code and experiments for "Deep Neural Networks for Rank Consistent Ordinal Regression based on Conditional Probabilities"

The official implementation code of "PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction."

Information-Theoretic Multi-Objective Bayesian Optimization with Continuous Approximations

Official repo for QHack—the quantum machine learning hackathon

Code for KDD'20 "Generative Pre-Training of Graph Neural Networks"

PyTorch/TorchScript compiler for NVIDIA GPUs using TensorRT

TensorFlow Similarity is a python package focused on making similarity learning quick and easy.

Volumetric Correspondence Networks for Optical Flow, NeurIPS 2019.

This repo is the code release of EMNLP 2021 conference paper "Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories".

An unofficial personal implementation of UM-Adapt, specifically to tackle joint estimation of panoptic segmentation and depth prediction for autonomous driving datasets.

CNN Based Meta-Learning for Noisy Image Classification and Template Matching

This is the 3D Implementation of 《Inconsistency-aware Uncertainty Estimation for Semi-supervised Medical Image Segmentation》

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

BisQue is a web-based platform designed to provide researchers with organizational and quantitative analysis tools for 5D image data. Users can extend BisQue by implementing containerized ML workflows.

Defocus Map Estimation and Deblurring from a Single Dual-Pixel Image

Implementation of " SESS: Self-Ensembling Semi-Supervised 3D Object Detection" (CVPR2020 Oral)