Yolov5+SlowFast: Realtime Action Detection Based on PytorchVideo

Last update: Dec 30, 2022

Related tags

Deep Learning yolo_slowfast

Overview

Yolov5+SlowFast: Realtime Action Detection

A realtime action detection frame work based on PytorchVideo.

Here are some details about our modification:

we choose yolov5 as an object detector instead of detectron2, it is faster and more convenient
we use a tracker(deepsort) to allocate action labels to all objects(with same ids) in different frames
our processing speed reached 24.2 FPS at 30 inference barch size (on a single RTX 2080Ti GPU)

Relevant infomation: FAIR/PytorchVideo; Ultralytics/Yolov5

Demo comparison betwween original(<-left) and ours(->right).

Installation

create a new python environment:
```
conda create -n env_name python=3.7.11
```
install requiments:
```
pip install -r requirements.txt
```
download weights file(ckpt.t7) from [deepsort] to this folder:
```
./deep_sort/deep_sort/deep/checkpoint/
```
test on your video:
```
python yolo_slowfast.py --input {path to your video}
```
The first time to execute this command may take some times to download the yolov5 code and it's weights file from torch.hub, keep your network connected.

References

Thanks for these great works:

[1] Ultralytics/Yolov5

[2] ZQPei/deepsort

[3] FAIR/PytorchVideo

[2] AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions. paper

[3] SlowFast Networks for Video Recognition. paper

Citation

If you find our work useful, please cite as follow:

{   yolo_slowfast,
    author = {Wu Fan},
    title = { A realtime action detection frame work based on PytorchVideo},
    year = {2021},
    url = {\url{https://github.com/wufan-tb/gmm_dae}}
}

Yolov5+SlowFast: Realtime Action Detection Based on PytorchVideo

Related tags

Overview

Yolov5+SlowFast: Realtime Action Detection

A realtime action detection frame work based on PytorchVideo.

Here are some details about our modification:

Demo comparison betwween original(<-left) and ours(->right).

Installation

References

Citation

Owner

WuFan

Official Implementation for Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation

A computer vision pipeline to identify the "icons" in Christian paintings

Source code of the paper PatchGraph: In-hand tactile tracking with learned surface normals.

PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset

PyMatting: A Python Library for Alpha Matting

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation

a simple, efficient, and intuitive text editor

Code release for Local Light Field Fusion at SIGGRAPH 2019

Class-Balanced Loss Based on Effective Number of Samples. CVPR 2019

Collection of common code that's shared among different research projects in FAIR computer vision team.

Python code for the paper How to scale hyperparameters for quickshift image segmentation

ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis

SuRE Evaluation: A Supplementary Material

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Mememoji - A facial expression classification system that recognizes 6 basic emotions: happy, sad, surprise, fear, anger and neutral.

An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

TorchX: A PyTorch Extension Library for More Efficient Deep Learning

An implementation of Deep Forest 2021.2.1.

This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.

YOLOv7 - Framework Beyond Detection