Yolox-bytetrack-sample - Python sample of MOT (Multiple Object Tracking) using YOLOX and ByteTrack

Last update: Nov 09, 2022

Overview

yolox-bytetrack-sample

YOLOXとByteTrackを用いたMOT(Multiple Object Tracking)のPythonサンプルです。
YOLOXはYOLOX-ONNX-TFLite-Sampleで、ONNXに変換したモデルを使用しています。

08-01.yolox-bytetrack.mp4

Requirement

OpenCV 3.4.2 or later
onnxruntime 1.5.2 or later
cython_bbox 0.1.3 or later

※onnxruntime-gpuはonnxruntimeでも動作しますが、推論時間がかかるためGPUを推奨します
※Windowsでcython_bbox のインストールが失敗する場合は、GitHubからのインストールをお試しください(2022/02/13時点)

pip install -e git+https://github.com/samson-wang/cython_bbox.git#egg=cython-bbox

Demo

デモの実行方法は以下です。

python sample.py

--device
カメラデバイス番号の指定
デフォルト：0
--movie
動画ファイルの指定 ※指定時はカメラデバイスより優先
デフォルト：指定なし
--width
カメラキャプチャ時の横幅
デフォルト：960
--height
カメラキャプチャ時の縦幅
デフォルト：540

YOLOXパラメータ

--yolox_model
ロードするモデルの格納パス
デフォルト：model/yolox_nano.onnx
--input_shape
モデルの入力サイズ
デフォルト：416,416
--score_th
クラス判別の閾値
デフォルト：0.3
--nms_th
NMSの閾値
デフォルト：0.45
--nms_score_th
NMSのスコア閾値
デフォルト：0.1
--with_p6
Large P6モデルを使用するか否か
デフォルト：指定なし

ByteTrackパラメータ

--track_thresh
デフォルト：0.5
--track_buffer
デフォルト：30
--match_thresh
デフォルト：0.8
--min_box_area
デフォルト：10
--mot20
デフォルト：指定なし

※パラメータ詳細はByteTrackを参照ください。

Reference

Author

高橋かずひと(https://twitter.com/KzhtTkhs)

License

yolox-bytetrack-sample is under MIT License.

License(Movie)

サンプル動画はNHKクリエイティブ・ライブラリーのケニア共和国キツイ町並み(4) ふかんショットを使用しています。

Yolox-bytetrack-sample - Python sample of MOT (Multiple Object Tracking) using YOLOX and ByteTrack

Related tags

Overview

yolox-bytetrack-sample

Requirement

Demo

Reference

Author

License

License(Movie)

Owner

KazuhitoTakahashi

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

Object Detection using YOLO from PyImageSearch

Projects for AI/ML and IoT integration for games and other presented at re:Invent 2021.

StyleGAN2-ADA-training-jupyter - Training custom datasets in styleGAN2-ADA by NVIDIA using Jupyter

DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models

Code for the IJCAI 2021 paper "Structure Guided Lane Detection"

ULMFiT for Genomic Sequence Data

DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency

3D ResNet Video Classification accelerated by TensorRT

Final Project for the CS238: Decision Making Under Uncertainty course at Stanford University in Autumn '21.

RMTD: Robust Moving Target Defence Against False Data Injection Attacks in Power Grids

A cross-lingual COVID-19 fake news dataset

Winners of DrivenData's Overhead Geopose Challenge

PyTorch Implementation of PIXOR: Real-time 3D Object Detection from Point Clouds

A multi-functional library for full-stack Deep Learning. Simplifies Model Building, API development, and Model Deployment.

Deep Convolutional Generative Adversarial Networks

Code for the paper "Improved Techniques for Training GANs"

Changing the Mind of Transformers for Topically-Controllable Language Generation

Vanilla and Prototypical Networks with Random Weights for image classification on Omniglot and mini-ImageNet. Made with Python3.

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network