YOLTv4 builds upon YOLT and SIMRDWN, and updates these frameworks to use the most performant version of YOLO, YOLOv4

Last update: Jan 06, 2023

Related tags

Overview

YOLTv4

YOLTv4 builds upon YOLT and SIMRDWN, and updates these frameworks to use the most performant version of YOLO, YOLOv4. YOLTv4 is designed to detect objects in aerial or satellite imagery in arbitrarily large images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks.

This repository is built upon the impressive work of AlexeyAB's YOLOv4 implementation, which improves both speed and detection performance compared to YOLOv3 (which is implemented in SIMRDWN). We use YOLOv4 insead of "YOLOv5", since YOLOv4 is endorsed by the original creators of YOLO, whereas "YOLOv5" is not; furthermore YOLOv4 appears to have superior performance.

Below, we provide examples of how to use this repository with the open-source Rareplanes dataset.

Running YOLTv4

0. Installation

YOLTv4 is built to execute within a docker container on a GPU-enabled machine. The docker command creates an Ubuntu 16.04 image with CUDA 9.2, python 3.6, and conda.

Clone this repository (e.g. to /yoltv4/).
Download model weights to yoltv4/darknet/weights). See: https://github.com/AlexeyAB/darknet/releases/download/darknet_yolo_v3_optimal/yolov4.conv.137 https://github.com/AlexeyAB/darknet/releases/download/darknet_yolo_v3_optimal/yolov4.weights https://github.com/AlexeyAB/darknet/releases/download/darknet_yolo_v4_pre/yolov4-tiny.weights https://github.com/AlexeyAB/darknet/releases/download/darknet_yolo_v4_pre/yolov4-csp.conv.142
Install nvidia-docker.

Build docker file.

 nvidia-docker build -t yoltv4_image /yoltv4/docker

Spin up the docker container (see the docker docs for options).

 NV_GPU=0 nvidia-docker run -it -v /local_data:/local_data -v /yoltv4:/yoltv4 -ti --ipc=host --name yoltv4_gpu0 yoltv4_image

Compile the Darknet C program.

First Set GPU=1 CUDNN=1, CUDNN_HALF=1, OPENCV=1 in /yoltv4/darknet/Makefile, then make:
```
 cd /yoltv4/darknet
 make
```

1. Train

A. Prepare Data

Make YOLO images and labels (see yoltv4/notebooks/train_test_pipeline.ipynb for further details).
Create a txt file listing the training images.
Create file obj.names file with each desired object name on its own line.

Create file obj.data in the directory yoltv4/darknet/data containing necessary files. For example:

/yoltv4/darknet/data/rareplanes_train.data

 classes = 30
 train =  /local_data/cosmiq/wdata/rareplanes/train/txt/train.txt
 valid =  /local_data/cosmiq/wdata/rareplanes/train/txt/valid.txt
 names =  /yoltv4/darknet/data/rareplanes.name
 backup = backup/

Prepare config files.

See instructions here, or tweak /yoltv4/darknet/cfg/yoltv4_rareplanes.cfg.

B. Execute Training

Execute.

 cd /yoltv4/darknet
 time ./darknet detector train data/rareplanes_train.data  cfg/yoltv4_rareplanes.cfg weights/yolov4.conv.137  -dont_show -mjpeg_port 8090 -map

Review progress (plotted at: /yoltv4/darknet/chart_yoltv4_rareplanes.png).

2. Test

A. Prepare Data

Make sliced images (see yoltv4/notebooks/train_test_pipeline.ipynb for further details).
Create a txt file listing the training images.
Create file obj.data in the directory yoltv4/darknet/data containing necessary files. For example:

/yoltv4/darknet/data/rareplanes_test.data classes = 30 train = valid = /local_data/cosmiq/wdata/rareplanes/test/txt/test.txt names = /yoltv4/darknet/data/rareplanes.name backup = backup/

B. Execute Testing

Execute (proceeds at >80 frames per second on a Tesla P100):

 cd /yoltv4/darknet
 time ./darknet detector valid data/rareplanes_test.data cfg/yoltv4_rareplanes.cfg backup/ yoltv4_rareplanes_best.weights

Post-process detections:

A. Move detections into results directory

 mkdir /yoltv4/darknet/results/rareplanes_preds_v0
 mkdir  /yoltv4/darknet/results/rareplanes_preds_v0/orig_txt
 mv /yoltv4/darknet/results/comp4_det_test_*  /yoltv4/darknet/results/rareplanes_preds_v0/orig_txt/

B. Stitch detections back together and make plots

 time python /yoltv4/yoltv4/post_process.py \
     --pred_dir=/yoltv4/darknet/results/rareplanes_preds_v0/orig_txt/ \
     --raw_im_dir=/local_data/cosmiq/wdata/rareplanes/test/images/ \
     --sliced_im_dir=/local_data/cosmiq/wdata/rareplanes/test/yoltv4/images_slice/ \
     --out_dir= /yoltv4/darknet/results/rareplanes_preds_v0 \
     --detection_thresh=0.25 \
     --slice_size=416} \
     --n_plots=8

Outputs will look something like the figures below:

YOLTv4 builds upon YOLT and SIMRDWN, and updates these frameworks to use the most performant version of YOLO, YOLOv4

Related tags

Overview

YOLTv4

Running YOLTv4

0. Installation

1. Train

A. Prepare Data

B. Execute Training

2. Test

A. Prepare Data

B. Execute Testing

Owner

Adam Van Etten

This is a virtual picture dragging application. Users may virtually slide photos across the screen. The distance between the index and middle fingers determines the movement. Smaller distances indicate click and motion, whereas bigger distances indicate only hand movement.

This is a re-implementation of TransGAN: Two Pure Transformers Can Make One Strong GAN (CVPR 2021) in PyTorch.

Code for 2021 NeurIPS --- Towards Multi-Grained Explainability for Graph Neural Networks

Official PyTorch implementation of "Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition" in AAAI2022.

Pytorch implementation of the paper "COAD: Contrastive Pre-training with Adversarial Fine-tuning for Zero-shot Expert Linking."

This project is based on our SIGGRAPH 2021 paper, ROSEFusion: Random Optimization for Online DenSE Reconstruction under Fast Camera Motion .

Analyzes your GitHub Profile and presents you with a report on how likely you are to become the next MLH Fellow!

An Open Source Machine Learning Framework for Everyone

This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' published at ECIR'22.

YOLOv5 detection interface - PyQt5 implementation

HyperPose is a library for building high-performance custom pose estimation applications.

Code for Motion Representations for Articulated Animation paper

Dataset VSD4K includes 6 popular categories: game, sport, dance, vlog, interview and city.

Implementation for "Manga Filling Style Conversion with Screentone Variational Autoencoder" (SIGGRAPH ASIA 2020 issue)

[AAAI2021] The source code for our paper 《Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion》.

Implicit Deep Adaptive Design (iDAD)

Unofficial TensorFlow implementation of the Keyword Spotting Transformer model

Discord-Protect is a simple discord bot allowing you to have some security on your discord server by ordering a captcha to the user who joins your server.

Libraries, tools and tasks created and used at DeepMind Robotics.

A deep learning network built with TensorFlow and Keras to classify gender and estimate age.