Libtorch yolov3 deepsort

Last update: Dec 13, 2022

Overview

It is for my undergrad thesis in Tsinghua University.

There are four modules in the project:

Detection: YOLOv3
Tracking: SORT and DeepSORT
Processing: Run detection and tracking, then display and save the results (a compressed video, a few snapshots for each target)
GUI: Display the results

YOLOv3

A Libtorch implementation of the YOLO v3 object detection algorithm, written with modern C++.

The code is based on the walktree.

The config file in .\models can be found at Darknet.

SORT

I also merged SORT to do tracking.

A similar software in Python is here, which also rewrite form the most starred version and SORT

DeepSORT

Recently I reimplement DeepSORT which employs another CNN for re-id. It seems it gives better result but also slows the program a bit. Also, a PyTorch version is available at ZQPei, thanks!

Performance

Currently on a GTX 1060 6G it consumes about 1G RAM and have 37 FPS.

The video I test is TownCentreXVID.avi.

GUI

With wxWidgets, I developed the GUI module for visualization of results.

Previously I used Dear ImGui. However, I do not think it suits my purpose.

Pre-trained network

This project uses pre-trained network weights from others

How to build

This project requires LibTorch, OpenCV, wxWidgets and CMake to build.

LibTorch can be easily integrated with CMake, but there are a lot of strange things...

On Ubuntu 16.04, I use apt install to install the others. Everything is fine. On Windows 10 + Visual Studio 2017, I use the latest stable version of the others from their official websites.

Snapshots

Here are some intermediate output from detection and tracking module:

Here is the snapshot of processing module:

Here is the snapshot of GUI module:

Libtorch yolov3 deepsort

Related tags

Overview

Overview

YOLOv3

SORT

DeepSORT

Performance

GUI

Pre-trained network

How to build

Snapshots

Owner

Xu Wei

GANimation: Anatomically-aware Facial Animation from a Single Image (ECCV'18 Oral) [PyTorch]

Code and experiments for "Deep Neural Networks for Rank Consistent Ordinal Regression based on Conditional Probabilities"

Code for "Reconstructing 3D Human Pose by Watching Humans in the Mirror", CVPR 2021 oral

PCGNN - Procedural Content Generation with NEAT and Novelty

EZ graph is an easy to use AI solution that allows you to make and train your neural networks without a single line of code.

Resco: A simple python package that report the effect of deep residual learning

Magisk module to enable hidden features on Android 12 Developer Preview 1.

UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation

Hysterese plugin with two temperature offset areas

GAN-based Matrix Factorization for Recommender Systems

Data and code for ICCV 2021 paper Distant Supervision for Scene Graph Generation.

Ros2-voiceroid2 - ROS2 wrapper package of VOICEROID2

GitHub repository for "Improving Video Generation for Multi-functional Applications"

GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot

Official implementation for "Image Quality Assessment using Contrastive Learning"

Fast, general, and tested differentiable structured prediction in PyTorch

Specification language for generating Generalized Linear Models (with or without mixed effects) from conceptual models

Official Repository for the paper "Improving Baselines in the Wild".

Segmentation vgg16 fcn - cityscapes

Relative Human dataset, CVPR 2022