Machine Learning to Denoise Images for Better OCR Accuracy

This project is an adaptation of this tutorial and used only for learning purposes: https://www.pyimagesearch.com/2021/10/20/using-machine-learning-to-denoise-images-for-better-ocr-accuracy/#download-the-code

Setting Up the project 🚀

First and foremost clone the project with:

$ git clone https://github.com/AntonioBriPerez/Ocr-Denoiser

You don't need to extract the zip files in order to train the model.

Once you have cloned the repository you will need to extract the features from the noisy images. This script will extract 5 x 5 - 25-d feature vectors and the it will extract the target (or cleaned) pixel value from the correspondiente ground truth standard image. And then, this features will be saved in a csv file (~200MB). To extract this features you will have to execute:

$ python3 build_features.py

It will generate the following output:

Once you have done that we will have to load those features in a proper split to train our Random Forest Regressor. That code is implemented in the file train_denoiser.py. To train the model you will have to run the command:

$ python train_denoiser.py

And it will generate:

To check that the model performs good you can execute:

$ python3 denoise_document.py --testing denoising-dirty-documents/test

And some images will be written in disk so you can check the original image and the image obtained by the model we just have trained.

Any doubts or suggestions please open an issue.

Machine Leaning applied to denoise images to improve OCR Accuracy

Related tags

Overview

Machine Learning to Denoise Images for Better OCR Accuracy

Setting Up the project 🚀

Owner

Antonio Bri Pérez

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Text Detection from images using OpenCV

Generate text images for training deep learning ocr model

基于Paddle框架的PSENet复现

Resizing Canny Countour In Python

DouZero is a reinforcement learning framework for DouDizhu - 斗地主AI

It is a image ocr tool using the Tesseract-OCR engine with the pytesseract package and has a GUI.

BoxToolBox is a simple python application built around the openCV library

Thresholding-and-masking-using-OpenCV - Image Thresholding is used for image segmentation

Handwriting Recognition System based on a deep Convolutional Recurrent Neural Network architecture

Satoshi is a discord bot template in python using discord.py that allow you to track some live crypto prices with your own discord bot.

An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come

OCR engine for all the languages

A curated list of papers, code and resources pertaining to image composition

Distilling Knowledge via Knowledge Review, CVPR 2021

Apply different text recognition services to images of handwritten documents.

A tool to enhance your old/damaged pictures built using python & opencv.

A simple component to display annotated text in Streamlit apps.

天池2021"全球人工智能技术创新大赛"【赛道一】：医学影像报告异常检测 - 第三名解决方案