Form Segmentation

Let's explore how we can extract text from any forms / scanned pages.

Objectives

The goal is to find an algorithm that can extract the maximum information from a given page (jpg format). So, we can feed it to another system. (Business logic, neural network, classifier, etc.) The overall process may not be perfect. But it would be great if it can find enough information to identify the type of document and the involve identities.

Parse any form / scanned page and extract any text data (printed text and handwriting text). So, no prior knowledge of the layout / structure of the document.
Automatic extraction process (no human interaction. So, it can scale out)
Somehow fast (or the ability to speed up the task with more machines or CPU)

Challenges

There are many challenges to overcome. But the main problem is to identify which part of the form contains text.

Some other challenges:

Black Border Removal
ICR (Intelligent Character Recognition): recognize and convert hand-drawn characters into text
Scanned page (Detect edges and apply a perspective transform to obtain the top-down view of the document)
Remove noise (blur, OTSU, adaptivethreshold with opencv)
Shape detection and extraction
OCR (Not a real issue since we can use : Tesseract 4 great for printed text)
Handwriting recognition
Minimize errors

Let's explore how we can extract text from forms

Related tags

Overview

Form Segmentation

Objectives

Challenges

Owner

Philip Doxakis

This is used to convert a string to an Image with Handwritten Characters.

Optical character recognition for Japanese text, with the main focus being Japanese manga

scantailor - Scan Tailor is an interactive post-processing tool for scanned pages.

A curated list of awesome synthetic data for text location and recognition

[BMVC'21] Official PyTorch Implementation of Grounded Situation Recognition with Transformers

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

Creating a virtual tv using opencv in python3.

🔎 Like Chardet. 🚀 Package for encoding & language detection. Charset detection.

Pixel art search engine for opengameart

(CVPR 2021) ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection

A Python wrapper for the tesseract-ocr API

An interactive interface for using OpenCV's GrabCut algorithm for image segmentation.

A PyTorch implementation of ECCV2018 Paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes

Neural search engine for AI papers

Geometric Augmentation for Text Image

computer vision, image processing and machine learning on the web browser or node.

Characterizing possible failure modes in physics-informed neural networks.

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

OCR, Object Detection, Number Plate, Real Time