ByteTrack(Multi-Object Tracking by Associating Every Detection Box)のPythonでのONNX推論サンプル

Last update: Oct 26, 2022

Overview

ByteTrack-ONNX-Sample

ByteTrack(Multi-Object Tracking by Associating Every Detection Box)のPythonでのONNX推論サンプルです。
ONNXに変換したモデルも同梱しています。
変換自体を試したい方はByteTrack_Convert2ONNX.ipynbを使用ください。
ByteTrack_Convert2ONNX.ipynbはColaboratory上での実行を想定しています。
書き動画はWindowsでの実行例です。

sample_.mp4

Requirement

opencv-python 4.5.3.56 or later
onnx 1.9.0 or later
onnxruntime-gpu 1.9.0 or later
Cython 0.29.24 or later
torch 1.8.1 or later
torchvision 0.9.1 or later
pycocotools 2.0.2 or later
scipy 1.6.3 or later
loguru 0.5.3 or later
thop 0.0.31.post2005241907 or later
lap 0.4.0 or later
cython_bbox 0.1.3 or later

※onnxruntime-gpuはonnxruntimeでも動作しますが、推論時間がかかるためGPUを推奨します
※Windowsでcython_bbox のインストールが失敗する場合は、GitHubからのインストールをお試しください(2021/11/19時点)
pip install -e git+https://github.com/samson-wang/cython_bbox.git#egg=cython-bbox

Demo

デモの実行方法は以下です。

動画：動画に対しByteTrackで追跡した結果を動画出力します

python demo_video_onnx.py

実行時オプション

--use_debug_window
動画書き込み時に書き込みフレームをGUI表示するか否か
デフォルト：指定なし
--model
ByteTrackのONNXモデル格納パス
デフォルト：byte_tracker/model/bytetrack_s.onnx
--video
入力動画の格納パス
デフォルト：sample.mp4
--output_dir
動画出力パス
デフォルト：output
--score_th
人検出のスコア閾値
デフォルト：0.1
--score_th
人検出のNMS閾値
デフォルト：0.7
--input_shape
推論時入力サイズ
デフォルト：608,1088
--with_p6
YOLOXモデルのFPN/PANでp6を含むか否か
デフォルト：指定なし
--track_thresh
追跡時のスコア閾値
デフォルト：0.5
--track_buffer
見失い時に何フレームの間、追跡対象を保持するか
デフォルト：30
--match_thresh
追跡時のマッチングスコア閾値
デフォルト：0.8
--min-box-area
最小のバウンディングボックスのサイズ閾値
デフォルト：10
--mot20
MOT20を使用しているか否か
デフォルト：指定なし

Webカメラ：Webカメラ画像に対しByteTrackで追跡した結果をGUI表示します

python demo_webcam_onnx.py

実行時オプション

--model
ByteTrackのONNXモデル格納パス
デフォルト：byte_tracker/model/bytetrack_s.onnx
--device
カメラデバイス番号の指定
デフォルト：0
--width
カメラキャプチャ時の横幅
デフォルト：960
--height
カメラキャプチャ時の縦幅
デフォルト：540
--score_th
人検出のスコア閾値
デフォルト：0.1
--score_th
人検出のNMS閾値
デフォルト：0.7
--input_shape
推論時入力サイズ
デフォルト：608,1088
--with_p6
YOLOXモデルのFPN/PANでp6を含むか否か
デフォルト：指定なし
--track_thresh
追跡時のスコア閾値
デフォルト：0.5
--track_buffer
見失い時に何フレームの間、追跡対象を保持するか
デフォルト：30
--match_thresh
追跡時のマッチングスコア閾値
デフォルト：0.8
--min-box-area
最小のバウンディングボックスのサイズ閾値
デフォルト：10
--mot20
MOT20を使用しているか否か
デフォルト：指定なし

Reference

ifzhang/ByteTrack

Author

高橋かずひと(https://twitter.com/KzhtTkhs)

License

ByteTrack-ONNX-Sample is under MIT License.

License(Movie)

サンプル動画はNHKクリエイティブ・ライブラリーのイギリスウースターのエルガー像を使用しています。

ByteTrack(Multi-Object Tracking by Associating Every Detection Box)のPythonでのONNX推論サンプル

Related tags

Overview

ByteTrack-ONNX-Sample

Requirement

Demo

動画：動画に対しByteTrackで追跡した結果を動画出力します

Webカメラ：Webカメラ画像に対しByteTrackで追跡した結果をGUI表示します

Reference

Author

License

License(Movie)

Owner

KazuhitoTakahashi

Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).

PyGCL: Graph Contrastive Learning Library for PyTorch

PyTorch implementation of Deformable Convolution

Annotate with anyone, anywhere.

Single Image Deraining Using Bilateral Recurrent Network (TIP 2020)

Codes for the compilation and visualization examples to the HIF vegetation dataset

A framework for the elicitation, specification, formalization and understanding of requirements.

FastFace: Lightweight Face Detection Framework

ViSER: Video-Specific Surface Embeddings for Articulated 3D Shape Reconstruction

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

Codebase for INVASE: Instance-wise Variable Selection - 2019 ICLR

novel deep learning research works with PaddlePaddle

PaddleBoBo是基于PaddlePaddle和PaddleSpeech、PaddleGAN等开发套件的虚拟主播快速生成项目

Code of the paper "Part Detector Discovery in Deep Convolutional Neural Networks" by Marcel Simon, Erik Rodner and Joachim Denzler

Official implementation of the ICCV 2021 paper: "The Power of Points for Modeling Humans in Clothing".

Moiré Attack (MA): A New Potential Risk of Screen Photos [NeurIPS 2021]

Pytorch implementation for ACMMM2021 paper "I2V-GAN: Unpaired Infrared-to-Visible Video Translation".

This is the code of NeurIPS'21 paper "Towards Enabling Meta-Learning from Target Models".

It's a powerful version of linebot

Geometry-Free View Synthesis: Transformers and no 3D Priors