This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

Last update: Nov 03, 2022

Overview

THE COMPUTER VISION DOJO

This repository was created to learn and gain new knowledge about computer vision and all its possible applications in the field of robotics and smart systems.

SOFTWARE DEPENDENCIES 💻

PYTHON DEPENDENCIES

Python
Python is a programming language that lets you work quickly and integrate systems more effectively.
OpenCV
OpenCV (Open Source Computer Vision Library) is a library of programming functions mainly aimed at real-time computer vision.
Numpy
Numpy is a general-purpose array-processing package. It provides a high-performance multidimensional array object, and tools for working with these arrays. It is the fundamental package for scientific computing with Python.
Matplotlib
Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python.

C++ DEPENDENCIES

Microsoft C++ Build Tools
The Microsoft C++ Build Tools provides MSVC toolsets via a scriptable, standalone installer without Visual Studio. Recommended if you build C++ libraries and applications targeting Windows from the command-line (e.g. as part of your continuous integration workflow). Includes tools shipped in Visual Studio 2015 Update 3, Visual Studio 2017 version 15.9, and all major updates to Visual Studio 2019 (v16.x).
OpenCV
OpenCV (Open Source Computer Vision Library) is a library of programming functions mainly aimed at real-time computer vision.
cmake
CMake is an open-source, cross-platform family of tools designed to build, test and package software.

AUTHOR

Elkin Javier Guerra Galeano

Student of Mechatronics Engineering at EIA University, excited for integrating Software and Hardware systems.
He is curious about Control Theory and implementing Robotics Solutions with different math designs.
He has skills with problem-solving for real-life applications. He is passionate about building knowledge from a theory-practice approach.

This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

Related tags

Overview

THE COMPUTER VISION DOJO

SOFTWARE DEPENDENCIES 💻

PYTHON DEPENDENCIES

C++ DEPENDENCIES

AUTHOR

Elkin Javier Guerra Galeano

Owner

Elkin Javier Guerra Galeano

RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

Sign Language Recognition service utilizing a deep learning model with Long Short-Term Memory to perform sign language recognition.

This repository lets you train neural networks models for performing end-to-end full-page handwriting recognition using the Apache MXNet deep learning frameworks on the IAM Dataset.

This repo contains several opencv projects done while learning opencv in python.

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

CNN+Attention+Seq2Seq

A list of hyperspectral image super-solution resources collected by Junjun Jiang

An OCR evaluation tool

make a better chinese character recognition OCR than tesseract

Shape Detection - It's a shape detection project with OpenCV and Python.

Image processing using OpenCv

Application that instantly translates sign-language to letters.

"Very simple but works well" Computer Vision based ID verification solution provided by LibraX.

End-to-end pipeline for real-time scene text detection and recognition.

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Text-to-Image generation

YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"

[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks