OCR system for Arabic language that converts images of typed text to machine-encoded text.

Last update: Jan 05, 2023

Overview

Arabic OCR

OCR system for Arabic language that converts images of typed text to machine-encoded text.
The system currently supports only letters (29 letters) ا-ى , لا.
The system aims to solve a simpler problem of OCR with images that contain only Arabic characters (check the dataset link below to see a sample of the images).

Setup

Install python then run this command:

pip install -r requirements.txt

Run

Put the images in src/test directory
Go to src directory and run the following command
```
python OCR.py
```
Output folder will be created with:
- text folder which has text files corresponding to the images.
- running_time file which has the time taken to process each image.

Pipeline

Dataset

Link to dataset of images and the corresponding text: here.
We used 1000 images to generate character dataset that we used for training.

Examples

Line Segmentation

Word Segmentation

Character Segmentation

Performance

Average accuracy: 95%.
Average time per image: 16 seconds.

NOTE

We achieved these results when we used only the flatten image as feature.

OCR system for Arabic language that converts images of typed text to machine-encoded text.

Related tags

Overview

Arabic OCR

Setup

Run

Pipeline

Dataset

Examples

Line Segmentation

Word Segmentation

Character Segmentation

Performance

References

Owner

Hussein Youssef

Color Picker and Color Detection tool for METR4202

OCR, Scene-Text-Understanding, Text Recognition

Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model"

Demo for the paper "Overlap-aware low-latency online speaker diarization based on end-to-end local segmentation"

Maze generator and solver with python

This repository lets you train neural networks models for performing end-to-end full-page handwriting recognition using the Apache MXNet deep learning frameworks on the IAM Dataset.

Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.

Document Layout Analysis

Sign Language Recognition service utilizing a deep learning model with Long Short-Term Memory to perform sign language recognition.

Convert scans of handwritten notes to beautiful, compact PDFs

Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder

This is a project to detect gestures to zoom in or out, using the real-time distance between the index finger and the thumb. It's based on OpenCV and Mediapipe.

✌️Using this you can control your PC/Laptop volume by Hand Gestures created with Python.

Resizing Canny Countour In Python

The world's simplest facial recognition api for Python and the command line

Vietnamese Language Detection and Recognition

Repository for Scene Text Detection with Supervised Pyramid Context Network with tensorflow.

This is a pytorch re-implementation of EAST: An Efficient and Accurate Scene Text Detector.