Machine Learning to Denoise Images for Better OCR Accuracy

This project is an adaptation of this tutorial and used only for learning purposes: https://www.pyimagesearch.com/2021/10/20/using-machine-learning-to-denoise-images-for-better-ocr-accuracy/#download-the-code

Setting Up the project 🚀

First and foremost clone the project with:

$ git clone https://github.com/AntonioBriPerez/Ocr-Denoiser

You don't need to extract the zip files in order to train the model.

Once you have cloned the repository you will need to extract the features from the noisy images. This script will extract 5 x 5 - 25-d feature vectors and the it will extract the target (or cleaned) pixel value from the correspondiente ground truth standard image. And then, this features will be saved in a csv file (~200MB). To extract this features you will have to execute:

$ python3 build_features.py

It will generate the following output:

Once you have done that we will have to load those features in a proper split to train our Random Forest Regressor. That code is implemented in the file train_denoiser.py. To train the model you will have to run the command:

$ python train_denoiser.py

And it will generate:

To check that the model performs good you can execute:

$ python3 denoise_document.py --testing denoising-dirty-documents/test

And some images will be written in disk so you can check the original image and the image obtained by the model we just have trained.

Any doubts or suggestions please open an issue.

Machine Leaning applied to denoise images to improve OCR Accuracy

Related tags

Overview

Machine Learning to Denoise Images for Better OCR Accuracy

Setting Up the project 🚀

Owner

Antonio Bri Pérez

Morphological edge detection or object's boundary detection using erosion and dialation in OpenCV python

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

Convert scans of handwritten notes to beautiful, compact PDFs

Corner-based Region Proposal Network

Framework for the Complete Gaze Tracking Pipeline

Text-to-Image generation

A simple component to display annotated text in Streamlit apps.

Document Image Dewarping

Train custom VR face tracking parameters

Use Convolutional Recurrent Neural Network to recognize the Handwritten line text image without pre segmentation into words or characters. Use CTC loss Function to train.

Simple app for visual editing of Page XML files

Detect handwritten words in a text-line (classic image processing method).

Code release for Hu et al., Learning to Segment Every Thing. in CVPR, 2018.

Official implementation of Character Region Awareness for Text Detection (CRAFT)

Handwritten Text Recognition (HTR) using TensorFlow 2.x

Document blur detection based on Laplacian operator and text detection.

EAST for ICPR MTWI 2018 Challenge II (Text detection of network images)

EQFace: An implementation of EQFace: A Simple Explicit Quality Network for Face Recognition

Awesome anomaly detection in medical images

【Auto】原神⭐钓鱼辅助工具 | 自动收竿、校准游标 | ✨您只需要抛出鱼竿，我们会帮你完成一切✨