The official implementation of paper Siamese Transformer Pyramid Networks for Real-Time UAV Tracking, accepted by WACV22

Last update: Jan 08, 2023

Related tags

Deep Learning SiamTPNTracker

Overview

SiamTPN

Introduction

This is the official implementation of the SiamTPN (WACV2022). The tracker intergrates pyramid feature network and transformer into Siamese network, achieving state-of-the-art performance (better than DiMP) while runing 30 FPS on a single CPU. The tracker optimized with ONXX and openvino could run at 45 FPS on cpu end, leading promising performance when deploying on drones for tracking.

[Paper] [Raw Results] [Drone Tracking Videos] [Models]

Training

prepare data

change the path in lib/train/admin/local.py to your data location

# Distributed training withh 4 nodes 
python -m torch.distributed.launch --nproc_per_node 4 tools/run_training.py --config shufflenet_l345_192

# single gpu training for test purpose
python tools/run_training.py --config shufflenet_l345_192

Test and evaluate SiamTPN

prepare data

change the path in lib/test/evaluation/local.py to your data location

running on cpu

# Download the pretrain model and put it under ./results/checkpoints/train/SiamTPN/ folder

python tools/test.py siamtpn shufflenet_l345_192 --dataset_name got10k_val --debug 1 --cpu 1 --epoch 100 --sequence GOT-10k_Val_000001

running on cpu with onnx optimized

The debug mode will show tracking results, more details refer to tools/test.py

Currently, onnx only support cpu version

First, you need to install onxx and onxxruningtime:

pip install onxx
# for onxx runining time, download the openvino version from release [page](https://github.com/intel/onnxruntime/releases/tag/v3.1) and install with
pip install onnxruntime_openvino-1.9.0-cp37-cp37m-linux_x86_64.whl

# please refer the [page](https://github.com/intel/onnxruntime/releases/tag/v3.1) for openvino installation details.

# Download the converted onnx model and put it under ./results/onnx/ folder
# or conver your own model with 
python tools/onnx_search.py
python tools/onnx_template.py

python tools/test.py siamtpn_onnx shufflenet_l345_192 --dataset_name got10k_val --debug 1 --cpu 1 --epoch 100 --sequence GOT-10k_Val_000001

Citation

Acknowledge

Our code is implemented based on the following libraries:

The official implementation of paper Siamese Transformer Pyramid Networks for Real-Time UAV Tracking, accepted by WACV22

Related tags

Overview

SiamTPN

Introduction

Training

prepare data

Test and evaluate SiamTPN

prepare data

running on cpu

running on cpu with onnx optimized

Citation

Acknowledge

Owner

Robotics and Intelligent Systems Control @ NYUAD

An end-to-end PyTorch framework for image and video classification

An implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch

Official implementation for the paper: Permutation Invariant Graph Generation via Score-Based Generative Modeling

A U-Net combined with a variational auto-encoder that is able to learn conditional distributions over semantic segmentations.

Generating Band-Limited Adversarial Surfaces Using Neural Networks

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

This repository contains FEDOT - an open-source framework for automated modeling and machine learning (AutoML)

Rethinking Transformer-based Set Prediction for Object Detection

An energy estimator for eyeriss-like DNN hardware accelerator

HTSeq is a Python library to facilitate processing and analysis of data from high-throughput sequencing (HTS) experiments.

😊 Python module for face feature changing

Face recognize and crop them

A new test set for ImageNet

Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021

Deep Learning for Time Series Classification

Modeling CNN layers activity with Gaussian mixture model

[ICCV 2021 Oral] PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers

MASS (Mueen's Algorithm for Similarity Search) - a python 2 and 3 compatible library used for searching time series sub-sequences under z-normalized Euclidean distance for similarity.

Pytoydl: A toy deep learning framework built upon numpy.

Text Extraction Formulation + Feedback Loop for state-of-the-art WSD (EMNLP 2021)