Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

Last update: Oct 14, 2022

Overview

ONNX Object Localization Network

Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

Original image: https://en.wikipedia.org/wiki/File:Interior_design_865875.jpg

Important

I added a bit of logic to the box color selection to make it look nicer. Since it performs K-Means for each box, it might be slow. If you only care about speed, you can either set all the boxes to the same color or use random colors.

Requirements

Check the requirements.txt file.
For ONNX, if you have a NVIDIA GPU, then install the onnxruntime-gpu, otherwise use the onnxruntime library.
Additionally, pafy and youtube-dl are required for youtube video inference.

Installation

git clone https://github.com/ibaiGorordo/ONNX-Object-Localization-Network.git
cd ONNX-Object-Localization-Network
pip install -r requirements.txt

ONNX Runtime

For Nvidia GPU computers: pip install onnxruntime-gpu

Otherwise: pip install onnxruntime

For youtube video inference

pip install youtube_dl
pip install git+https://github.com/zizo-pro/[email protected]

ONNX model

The original model was converted to ONNX by PINTO0309, download the models from the download script in his repository and save them into the models folder.

The License of the models is Apache-2.0 License: https://github.com/mcahny/object_localization_network/blob/main/LICENSE

Pytorch model

The original Pytorch model can be found in this repository: https://github.com/mcahny/object_localization_network

Examples

Image inference:

python image_object_localization.py

Webcam inference:

python webcam_object_localization.py

Video inference: https://youtu.be/n9qhQJXYUWo

python video_object_localization.py

Original video: https://youtu.be/vgJUXvkdS78

References:

Object-Localization-Network model: https://github.com/mcahny/object_localization_network
PINTO0309's model zoo: https://github.com/PINTO0309/PINTO_model_zoo
Original paper: https://arxiv.org/abs/2108.06753

Python scripts performing class agnostic object localization using the Object Localization Network model in ONNX.

Related tags

Overview

ONNX Object Localization Network

Important

Requirements

Installation

ONNX Runtime

For youtube video inference

ONNX model

Pytorch model

Examples

References:

Owner

Ibai Gorordo

Food recognition model using convolutional neural network & computer vision

Source code for our Paper "Learning in High-Dimensional Feature Spaces Using ANOVA-Based Matrix-Vector Multiplication"

DAN: Unfolding the Alternating Optimization for Blind Super Resolution

NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch

A graph adversarial learning toolbox based on PyTorch and DGL.

AirLoop: Lifelong Loop Closure Detection

A project for developing transformer-based models for clinical relation extraction

Genshin-assets - 👧 Public documentation & static assets for Genshin Impact data.

Neural Factorization of Shape and Reflectance Under An Unknown Illumination

This repository contains a toolkit for collecting, labeling and tracking object keypoints

Generalizing Gaze Estimation with Outlier-guided Collaborative Adaptation

ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation.

Dungeons and Dragons randomized content generator

Official PyTorch implementation of "RMGN: A Regional Mask Guided Network for Parser-free Virtual Try-on" (IJCAI-ECAI 2022)

Orthogonal Over-Parameterized Training

RNG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering

Credo AI Lens is a comprehensive assessment framework for AI systems. Lens standardizes model and data assessment, and acts as a central gateway to assessments created in the open source community.

PyTorch implementation of "Image-to-Image Translation Using Conditional Adversarial Networks".

Datasets, tools, and benchmarks for representation learning of code.

GLM (General Language Model)