Official PyTorch implementation of Joint Object Detection and Multi-Object Tracking with Graph Neural Networks

Last update: Dec 06, 2022

Related tags

Overview

GSDT

Joint Object Detection and Multi-Object Tracking with Graph Neural Networks

This is the official PyTorch implementation of our paper: "Joint Object Detection and Multi-Object Tracking with Graph Neural Networks". Our project website and video demos are here. If you find our work useful, we'd appreciate you citing our paper as follows:

@article{Wang2020_GSDT, 
author = {Wang, Yongxin and Kitani, Kris and Weng, Xinshuo}, 
journal = {arXiv:2006.13164}, 
title = {{Joint Object Detection and Multi-Object Tracking with Graph Neural Networks}}, 
year = {2020} 
}

Introduction

Object detection and data association are critical components in multi-object tracking (MOT) systems. Despite the fact that the two components are dependent on each other, prior work often designs detection and data association modules separately which are trained with different objectives. As a result, we cannot back-propagate the gradients and optimize the entire MOT system, which leads to sub-optimal performance. To address this issue, recent work simultaneously optimizes detection and data association modules under a joint MOT framework, which has shown improved performance in both modules. In this work, we propose a new instance of joint MOT approach based on Graph Neural Networks (GNNs). The key idea is that GNNs can model relations between variable-sized objects in both the spatial and temporal domains, which is essential for learning discriminative features for detection and data association. Through extensive experiments on the MOT15/16/17/20 datasets, we demonstrate the effectiveness of our GNN-based joint MOT approach and show the state-of-the-art performance for both detection and MOT tasks.

Usage

Dependencies

We recommend using anaconda for managing dependency and environments. You may follow the commands below to setup your environment.

conda create -n dev python=3.6
conda activate dev
pip install -r requirements.txt

We use the PyTorch Geometric package for the implementation of our Graph Neural Network based architecture.

bash install_pyg.sh   # we used CUDA_version=cu101

Build Deformable Convolutional Networks V2 (DCNv2)

cd ./src/lib/models/networks/DCNv2
bash make.sh

To automatically generate output tracking as videos, please install ffmpeg

conda install ffmpeg=4.2.2

Data preperation

We follow the same dataset setup as in JDE. Please refer to their DATA ZOO for data download and preperation.

To prepare 2DMOT15 and MOT20 data, you can directly download from the MOT Challenge website, and format each directory as follows:

MOT15
   |——————images
   |        └——————train
   |        └——————test
   └——————labels_with_ids
            └——————train(empty)
MOT20
   |——————images
   |        └——————train
   |        └——————test
   └——————labels_with_ids
            └——————train(empty)

Then change the seq_root and label_root in src/gen_labels_15.py and src/gen_labels_20.py accordingly, and run:

cd src
python gen_labels_15.py
python gen_labels_20.py

This will generate the desired label format of 2DMOT15 and MOT20. The seqinfo.ini files are required for 2DMOT15 and can be found here [Google], [Baidu],code:8o0w.

Inference

Download and save the pretrained weights for each dataset by following the links below:

Dataset	Model
2DMOT15	model_mot15.pth
MOT17	model_mot17.pth
MOT20	model_mot20.pth

Run one of the following command to reproduce our paper's tracking performance on the MOT Challenge.

cd ./experiments
track_gnn_mot_AGNNConv_RoIAlign_mot15.sh 
track_gnn_mot_AGNNConv_RoIAlign_mot17.sh 
track_gnn_mot_AGNNConv_RoIAlign_mot20.sh

To clarify, currently we directly used the MOT17 results as MOT16 results for submission. That is, our MOT16 and MOT17 results and models are identical.

Training

We are currently in the process of cleaning the training code. We'll release as soon as we can. Stay tuned!

Performance on MOT Challenge

You can refer to MOTChallenge website for performance of our method. For your convenience, we summarize results below:

Dataset	MOTA	IDF1	MT	ML	IDS
2DMOT15	60.7	64.6	47.0%	10.5%	477
MOT16	66.7	69.2	38.6%	19.0%	959
MOT17	66.2	68.7	40.8%	18.3%	3318
MOT20	67.1	67.5	53.1%	13.2%	3133

Acknowledgement

A large part of the code is borrowed from FairMOT. We appreciate their great work!

Official PyTorch implementation of Joint Object Detection and Multi-Object Tracking with Graph Neural Networks

Related tags

Overview

GSDT

Joint Object Detection and Multi-Object Tracking with Graph Neural Networks

Introduction

Usage

Dependencies

Data preperation

Inference

Training

Performance on MOT Challenge

Acknowledgement

Owner

Richard Wang

Implementation of Nyström Self-attention, from the paper Nyströmformer

Metrics to evaluate quality and efficacy of synthetic datasets.

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务

🇰🇷 Text to Image in Korean

Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

This repository lets you interact with Lean through a REPL.

Torch implementation of SegNet and deconvolutional network

AI Virtual Calculator: This is a simple virtual calculator based on Artificial intelligence.

Dataset and Source code of paper 'Enhancing Keyphrase Extraction from Academic Articles with their Reference Information'.

BASH - Biomechanical Animated Skinned Human

Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)

🤗 Paper Style Guide

Evaluation toolkit of the informative tracking benchmark comprising 9 scenarios, 180 diverse videos, and new challenges.

Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

Multivariate Time Series Transformer, public version

Unofficial PyTorch implementation of "RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving" (ECCV 2020)

Implementation of Learning Gradient Fields for Molecular Conformation Generation (ICML 2021).

Materials for upcoming beginner-friendly PyTorch course (work in progress).

an Evolutionary Algorithm assisted GAN

LONG-TERM SERIES FORECASTING WITH QUERYSELECTOR – EFFICIENT MODEL OF SPARSEATTENTION

Official PyTorch implementation of Joint Object Detection and Multi-Object Tracking with Graph Neural Networks

Related tags

Overview

GSDT

Joint Object Detection and Multi-Object Tracking with Graph Neural Networks

Introduction

Usage

Dependencies

Data preperation

Inference

Training

Performance on MOT Challenge

Acknowledgement

Owner

Richard Wang

Implementation of Nyström Self-attention, from the paper Nyströmformer

Metrics to evaluate quality and efficacy of synthetic datasets.

“英特尔创新大师杯”深度学习挑战赛 赛道3：CCKS2021中文NLP地址相关性任务

🇰🇷 Text to Image in Korean

Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

This repository lets you interact with Lean through a REPL.

Torch implementation of SegNet and deconvolutional network

AI Virtual Calculator: This is a simple virtual calculator based on Artificial intelligence.

Dataset and Source code of paper 'Enhancing Keyphrase Extraction from Academic Articles with their Reference Information'.

BASH - Biomechanical Animated Skinned Human

Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)

🤗 Paper Style Guide

Evaluation toolkit of the informative tracking benchmark comprising 9 scenarios, 180 diverse videos, and new challenges.

Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

Multivariate Time Series Transformer, public version

Unofficial PyTorch implementation of "RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving" (ECCV 2020)

Implementation of Learning Gradient Fields for Molecular Conformation Generation (ICML 2021).

Materials for upcoming beginner-friendly PyTorch course (work in progress).

an Evolutionary Algorithm assisted GAN

LONG-TERM SERIES FORECASTING WITH QUERYSELECTOR – EFFICIENT MODEL OF SPARSEATTENTION

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务