Object Detection and Multi-Object Tracking

Last update: Jan 04, 2023

Overview

Object Detection and Tracking

Object detection is a computer technology related to computer vision and image processing that deals with detecting instances of semantic objects of a certain class (such as humans, buildings, or cars) in digital images and videos.

Environment

I have tested on Ubuntu 16.04/18.04. The code may work on other systems.

[Ubuntu-Deep-Learning-Environment-Setup]

Ubuntu 16.04 / 18.04
ROS Kinetic / Melodic
GTX 1080Ti / RTX 2080Ti
python 2.7 / 3.6

Installation

Clone the repository

git clone https://github.com/yehengchen/Object-Detection-and-Tracking.git

[OneStage]

YOLO: Real-Time Object Detection and Tracking

How to train a YOLO model on custom images: YOLOv3 - [Link] / YOLOv4 - [Link]

YOLOv4 + Deep_SORT - Pedestrian Counting & Social Distance - [Link]
YOLOv3 + Deep_SORT - Pedestrian&Car Counting - [Link]

YOLOv3 + SORT - Pedestrian Counting - [Link]

Darknet_ROS: Real-Time Object Detection and Grasp Detection With ROS

YOLOv3 + ROS Kinetic - For small Custom Data - [Link]

YOLOv3 + ROS Melodic - Robot Grasp Detection - [Link]
Parts-Arrangement-Robot - [Link]

YOLOv3 + OpenCV + ROS Melodic - Object Detection (Rotated) - [Link]

SSD: Single Shot MultiBox Detector

How to train a SSD model on own images - [Link]

[TwoStage]

R-CNN: Region-based methods

Fast R-CNN / Faster R-CNN / Mask R-CNN

How to train a Mask R-CNN model on own images - [Link]

Mask R-CNN + ROS Kinetic - [Link]

This project is ROS package of Mask R-CNN algorithm for object detection and segmentation.

COCO & VOC Datasets

COCO dataset and Pascal VOC dataset - [Link]
How to get it working on the COCO dataset coco2voc - [Link]
Convert Dataset2Yolo - COCO / VOC - [Link]

Object Detection and Multi-Object Tracking

Related tags

Overview

Object Detection and Tracking

Environment

Ubuntu 16.04 / 18.04

ROS Kinetic / Melodic

GTX 1080Ti / RTX 2080Ti

python 2.7 / 3.6

Installation

[OneStage]

YOLO: Real-Time Object Detection and Tracking

YOLOv4 + Deep_SORT - Pedestrian Counting & Social Distance - [Link]

YOLOv3 + Deep_SORT - Pedestrian&Car Counting - [Link]

YOLOv3 + SORT - Pedestrian Counting - [Link]

Darknet_ROS: Real-Time Object Detection and Grasp Detection With ROS

YOLOv3 + ROS Kinetic - For small Custom Data - [Link]

YOLOv3 + ROS Melodic - Robot Grasp Detection - [Link]

Parts-Arrangement-Robot - [Link]

YOLOv3 + OpenCV + ROS Melodic - Object Detection (Rotated) - [Link]

SSD: Single Shot MultiBox Detector

How to train a SSD model on own images - [Link]

[TwoStage]

R-CNN: Region-based methods

Mask R-CNN + ROS Kinetic - [Link]

COCO & VOC Datasets

COCO dataset and Pascal VOC dataset - [Link]

How to get it working on the COCO dataset coco2voc - [Link]

Convert Dataset2Yolo - COCO / VOC - [Link]

CV & Robotics Paper List (3D object detection & 6D pose estimation) - [Link]

PapersWithCode: Browse > Computer Vision > Object Detection - [Link]

ObjectDetection Two-stage vs One-stage Detectors - [Link]

ObjectDetection mAP & IoU - [Link]

Owner

Bobby Chen

official implementation for the paper "Simplifying Graph Convolutional Networks"

This repository provides a PyTorch implementation and model weights for HCSC (Hierarchical Contrastive Selective Coding)

A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run.

Code for LIGA-Stereo Detector, ICCV'21

Self-supervised learning on Graph Representation Learning (node-level task)

통일된 DataScience 폴더 구조 제공 및 가상환경 작업의 부담감 해소

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

🔥 Cogitare - A Modern, Fast, and Modular Deep Learning and Machine Learning framework for Python

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

discovering subdomains, hidden paths, extracting unique links

Leveraging OpenAI's Codex to solve cornerstone problems in Music

URIE: Universal Image Enhancementfor Visual Recognition in the Wild

DanceTrack: Multiple Object Tracking in Uniform Appearance and Diverse Motion

Code for paper "ASAP-Net: Attention and Structure Aware Point Cloud Sequence Segmentation"

Alphabetical Letter Recognition

GAN-generated image detection based on CNNs

NAS-FCOS: Fast Neural Architecture Search for Object Detection (CVPR 2020)

A Simple LSTM-Based Solution for "Heartbeat Signal Classification and Prediction" in Tianchi

Unsupervised Video Interpolation using Cycle Consistency

DANA paper supplementary materials