STMTrack: Template-free Visual Tracking with Space-time Memory Networks

Last update: Dec 21, 2022

Related tags

Overview

STMTrack

This is the official implementation of the paper: STMTrack: Template-free Visual Tracking with Space-time Memory Networks.

Setup

Prepare Anaconda, CUDA and the corresponding toolkits. CUDA version required: 10.0+
Create a new conda environment and activate it.

conda create -n STMTrack python=3.7 -y
conda activate STMTrack

Install pytorch and torchvision.

conda install pytorch==1.4.0 torchvision==0.5.0 cudatoolkit=10.0 -c pytorch
# pytorch v1.5.0, v1.6.0, or higher should also be OK.

Install other required packages.

pip install -r requirements.txt

Test

Prepare the datasets: OTB2015, VOT2018, UAV123, GOT-10k, TrackingNet, LaSOT, ILSVRC VID*, ILSVRC DET*, COCO*, and something else you want to test. Set the paths as the following:

├── STMTrack
|   ├── ...
|   ├── ...
|   ├── datasets
|   |   ├── COCO -> /opt/data/COCO
|   |   ├── GOT-10k -> /opt/data/GOT-10k
|   |   ├── ILSVRC2015 -> /opt/data/ILSVRC2015
|   |   ├── LaSOT -> /opt/data/LaSOT/LaSOTBenchmark
|   |   ├── OTB
|   |   |   └── OTB2015 -> /opt/data/OTB2015
|   |   ├── TrackingNet -> /opt/data/TrackingNet
|   |   ├── UAV123 -> /opt/data/UAV123/UAV123
|   |   ├── VOT
|   |   |   ├── vot2018
|   |   |   |   ├── VOT2018 -> /opt/data/VOT2018
|   |   |   |   └── VOT2018.json

Notes

i. Star notation(*): just for training. You can ignore these datasets if you just want to test the tracker.

ii. In this case, we create soft links for every dataset. The real storage location of all datasets is /opt/data/. You can change them according to your situation.

iii. The VOT2018.json file can be download from here.

Download the models we trained.

📎 GOT-10k model 📎 fulldata model
Use the path of the trained model to set the pretrain_model_path item in the configuration file correctly, then run the shell command.
Note that all paths we used here are relative, not absolute. See any configuration file in the experiments directory for examples and details.

General command format

python main/test.py --config testing_dataset_config_file_path

Take GOT-10k as an example:

python main/test.py --config experiments/stmtrack/test/got10k/stmtrack-googlenet-got.yaml

Training

Prepare the datasets as described in the last subsection.
Download the pretrained backbone model from here.
Run the shell command.

training based on the GOT-10k benchmark

python main/train.py --config experiments/stmtrack/train/got10k/stmtrack-googlenet-trn.yaml

training with full data

python main/train.py --config experiments/stmtrack/train/fulldata/stmtrack-googlenet-trn-fulldata.yaml

Testing Results

Click here to download all the following.

OTB2015
GOT-10k
LaSOT
TrackingNet
UAV123
TNL2K
- evaluated by @Xiao Wang.
- The results can be downloaded from Google Drive. See issue #2 for more details.

Acknowledgement

Repository

This repository is developed based on the single object tracking framework video_analyst. See it for more instructions and details.

References

@inproceedings{fu2021stmtrack,
  title={STMTrack: Template-free Visual Tracking with Space-time Memory Networks},
  author={Fu, Zhihong and Liu, Qingjie and Fu, Zehua and Wang, Yunhong},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={13774--13783},
  year={2021}
}

Contact

Zhihong Fu@fzh0917

If you have any questions, just create issues or email me 😄 .

STMTrack: Template-free Visual Tracking with Space-time Memory Networks

Related tags

Overview

STMTrack

Setup

Test

General command format

Training

training based on the GOT-10k benchmark

training with full data

Testing Results

Acknowledgement

Repository

References

Contact

Owner

Zhihong Fu

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

DeepLab-ResNet rebuilt in TensorFlow

This is an official implementation for "Self-Supervised Learning with Swin Transformers".

PSANet: Point-wise Spatial Attention Network for Scene Parsing, ECCV2018.

Implementations of polygamma, lgamma, and beta functions for PyTorch

A Loss Function for Generative Neural Networks Based on Watson’s Perceptual Model

Official implementation for TTT++: When Does Self-supervised Test-time Training Fail or Thrive

[CVPR 2022 Oral] Balanced MSE for Imbalanced Visual Regression https://arxiv.org/abs/2203.16427

MTA:SA Server Configer.

Code release for Local Light Field Fusion at SIGGRAPH 2019

The official repository for Deep Image Matting with Flexible Guidance Input

Implementation of UNet on the Joey ML framework

Pytorch implementation of various High Dynamic Range (HDR) Imaging algorithms

Using VapourSynth with super resolution models and speeding them up with TensorRT.

Code accompanying the paper "How Tight Can PAC-Bayes be in the Small Data Regime?"

Implementation for the paper: Invertible Denoising Network: A Light Solution for Real Noise Removal (CVPR2021).

Official Code for "Non-deep Networks"

Machine Learning Model deployment for Container (TensorFlow Serving)

PyTorch implementation of Higher Order Recurrent Space-Time Transformer

[NeurIPS 2021] Garment4D: Garment Reconstruction from Point Cloud Sequences