Python Tensorflow 2 scripts for detecting objects of any class in an image without knowing their label.

Last update: Nov 15, 2022

Overview

Tensorflow-Mobile-Generic-Object-Localizer

Python Tensorflow 2 scripts for detecting objects of any class in an image without knowing their label.

Original image taken from the OpenCV AI Kit - Lite, make sure to check it out: https://www.kickstarter.com/projects/opencv/opencv-ai-kit-oak-depth-camera-4k-cv-edge-object-detection

❗ ⚠️ The object detector works better with images with few objects and it starts to fail in more complex scenes. The model is suitable for automatically labelling objects for custom object detection models.

Requirements

OpenCV, imread-from-url and tensorflow. Also, pafy and youtube-dl are required for youtube video inference.

Installation

pip install -r requirements.txt
pip install pafy youtube-dl

Tensorflow model

The original models was taken from Tensorflow Hub, download it, and place it in the models folder.

Use the following script to download the model:

python download_model.py

Examples

Image inference:

python imageObjectDetection.py

Webcam inference:

python webcamObjectDetection.py

Video inference:

python videoObjectDetection.py

Inference Examples

Original video by Animist: https://youtu.be/uKyoV0uG9rQ

Astronaut detection

Original image: https://commons.wikimedia.org/wiki/File:Astronaut_Standing_On_The_Moon.png

Excavator detection

Original image: https://en.wikipedia.org/wiki/Hitachi_Construction_Machinery_(Europe)#/media/File:ZX350LCN-3-Photo28-lo.jpg

Map island detection

Original image: https://ja.m.wikipedia.org/wiki/%E3%83%95%E3%82%A1%E3%82%A4%E3%83%AB:Map_of_Hawaii_highlighting_Hawaii_(island).svg

Phone accessories detection

Original image: https://upload.wikimedia.org/wikipedia/commons/thumb/1/1b/OnePlus_3_phone%2C_charger_and_package.jpg/1024px-OnePlus_3_phone%2C_charger_and_package.jpg

And many more

References:

Original model: https://tfhub.dev/google/object_detection/mobile_object_localizer_v1/1

Python Tensorflow 2 scripts for detecting objects of any class in an image without knowing their label.

Related tags

Overview

Tensorflow-Mobile-Generic-Object-Localizer

❗ ⚠️ The object detector works better with images with few objects and it starts to fail in more complex scenes. The model is suitable for automatically labelling objects for custom object detection models.

Requirements

Installation

Tensorflow model

Examples

Inference Examples

Astronaut detection

Excavator detection

Map island detection

Phone accessories detection

And many more

References:

Owner

Ibai Gorordo

Analysis of Smiles through reservoir sampling & RDkit

PyTorch and GPyTorch implementation of the paper "Conditioning Sparse Variational Gaussian Processes for Online Decision-making."

Experimenting with computer vision techniques to generate annotated image datasets from gameplay recordings automatically.

Repo for EchoVPR: Echo State Networks for Visual Place Recognition

[CVPR2021] Invertible Image Signal Processing

Complex Answer Generation For Conversational Search Systems.

2D&3D human pose estimation

DL course co-developed by YSDA, HSE and Skoltech

a dnn ai project to classify which food people are eating on audio recordings

Implementation of H-UCRL Algorithm

Knowledge Management for Humans using Machine Learning & Tags

App for identification of various objects. Based on YOLO v4 tiny architecture

Official implementation for "Low-light Image Enhancement via Breaking Down the Darkness"

PyTorch implementation of "Representing Shape Collections with Alignment-Aware Linear Models" paper.

[CVPR 2022 Oral] EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

Code release for NeuS

A free, multiplatform SDK for real-time facial motion capture using blendshapes, and rigid head pose in 3D space from any RGB camera, photo, or video.

Source code for CVPR 2020 paper "Learning to Forget for Meta-Learning"

The software associated with a paper accepted at EMNLP 2021 titled "Open Knowledge Graphs Canonicalization using Variational Autoencoders".

CLIP + VQGAN / PixelDraw