In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

Last update: Dec 20, 2022

Overview

Virtual Mouse Using OpenCV

In this project we will be using the live feed coming from the webcam to create a virtual mouse using hand tracking.

Project Description:

In this project, I am using my hand as a virtual mouse than can do everything that a mouse does without even touching your system. I am using the webcam of my system to detect my hands. It will then create a bounding box around my hand and focus on two fingers: The fore finger and the middle finger. The fore finger will act as a cursor and moving it around, we will be moving the cursor around. Now, inorder to successfully click using hand tracking, it is detecting the distance between the fore finger and the middle finger. If they are joined together, then it will perform a click.

Furthermore, a smoothness factor was added as the movement was really shaky.

Requirements:

Following modules need to be installed for it to work properly:

OpenCV
Mediapipe
Autopy

OpenCV:

OpenCV is a huge open-source library for computer vision, machine learning, and image processing. OpenCV supports a wide variety of programming languages like Python, C++, Java, etc. It can process images and videos to identify objects, faces, or even the handwriting of a human.

It can be installed using "pip install opencv-python"

Mediapipe:

MediaPipe is a framework for building multimodal (eg. video, audio, any time series data), cross platform (i.e Android, iOS, web, edge devices) applied ML pipelines.

It can be installed using "pip install mediapipe"

Autopy:

AutoPy is a simple, cross-platform GUI automation library for Python. It includes functions for controlling the keyboard and mouse, finding colors and bitmaps on-screen, and displaying alerts.

It can be installed using "pip install autopy"

Important Note:

I faced alot of dependency issues throughout this project. Some of the issues and their solutions are as follows:

autopy not installing: This is because autopy currently doesn't support Python versions above 3.8
webcam not opening: It was a bug in mediapipe and was fixed in latest python versions

Hence, inorder for the project to run smoothly, you need to degrade the Python version to 3.8

How to Degrade Python Version:

Follow the following steps:

Uninstall Python from add/remove programs
Go to AppData and remove any python folder you see.
Download Python 3.8 from this link : Python 3.8
Install it.
Open command promt and run "pip" inorder to confirm installation.
Your Python version has been degraded :)

Contact Information:

For any further queries, feel free to contact me at:

Email: [email protected]

LinkedIn : Hassan Shahzad

In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

Related tags

Overview

Virtual Mouse Using OpenCV

Project Description:

Requirements:

OpenCV:

Mediapipe:

Autopy:

Important Note:

How to Degrade Python Version:

Contact Information:

Owner

Hassan Shahzad

A bot that plays TFT using OCR. Keeps track of bench, board, items, and plays the user defined team comp.

📷 Face Recognition using Haar-Cascade Classifier, OpenCV, and Python

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

TextBoxes++: A Single-Shot Oriented Scene Text Detector

MeshToGeotiff - A fast Python algorithm to convert a 3D mesh into a GeoTIFF

A tensorflow implementation of EAST text detector

Color Picker and Color Detection tool for METR4202

Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

a Deep Learning Framework for Text

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

PyTorch Re-Implementation of EAST: An Efficient and Accurate Scene Text Detector

Image processing is one of the most common term in computer vision

A tool combining EasyOCR and LaMa to automatically detect text and replace it with an inpainted background.

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

Handwritten Text Recognition (HTR) system implemented with TensorFlow (TF) and trained on the IAM off-line HTR dataset. This Neural Network (NN) model recognizes the text contained in the images of segmented words.

Face Recognizer using Opencv Python

This repository contains the code for the paper "SCANimate: Weakly Supervised Learning of Skinned Clothed Avatar Networks"

[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别