Detect handwritten words in a text-line (classic image processing method).

Last update: Jan 03, 2023

Overview

Word segmentation

Implementation of scale space technique for word segmentation as proposed by R. Manmatha and N. Srimal. Even though the paper is from 1999, the method still achieves good results, is fast, and is easy to implement. The algorithm takes an image of a line as input and outputs the segmented words.

Run demo

Go to the src/ directory and run the script python main.py. The images from the data/ directory (taken from IAM dataset) are segmented into words and the results are saved to the out/ directory.

Documentation

An anisotropic filter kernel is applied to the input image to create blobs corresponding to words. After thresholding the blob-image, connected components are extracted which correspond to words.

Parameters

Most of the parameters of the function wordSegmentation deal with the shape of the filter kernel:

img: grayscale uint8 image of the text-line to be segmented.
kernelSize: size of filter kernel, must be an odd integer.
sigma: standard deviation of Gaussian function used for filter kernel.
theta: approximated width/height ratio of words, filter function is distorted by this factor.
minArea: ignore word candidates smaller than specified area.

The function prepareImg can be used to convert the input image to grayscale and to resize it to a fixed height:

img: input image.
height: image will be resized to fit specified height.

Algorithm

The illustration below shows how the algorithm works:

top left: input image.
top right: filter kernel is applied.
bottom left: blob image after thresholding.
bottom right: bounding boxes around words in original image.

Results

This algorithm gives good results on datasets with large inter-word-distances and small intra-word-distances like IAM. However, for historical datasets like Bentham or Ratsprotokolle results are not very good and more complex approaches should be used instead (e.g., a neural network based approach as implemented in the WordDetectorNN repository).

Detect handwritten words in a text-line (classic image processing method).

Related tags

Overview

Word segmentation

Run demo

Documentation

Parameters

Algorithm

Results

Owner

Harald Scheidl

A curated list of papers and resources for scene text detection and recognition

Document Layout Analysis

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約

Tesseract Open Source OCR Engine (main repository)

Pre-Recognize Library - library with algorithms for improving OCR quality.

Detecting Text in Natural Image with Connectionist Text Proposal Network (ECCV'16)

✌️Using this you can control your PC/Laptop volume by Hand Gestures created with Python.

BD-ALL-DIGIT - This Is Bangladeshi All Sim Cloner Tools

This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training

Driver Drowsiness Detection with OpenCV & Dlib

Textboxes implementation with Tensorflow (python)

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

Educational application aimed at automating user-defined workflows for the mobile game, "Granblue Fantasy", using a variety of CV technologies in the backend such as OpenCV, PyAutoGUI and EasyOCR and a frontend coded in Typescript.

ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.

Text Detection from images using OpenCV

Bu uygulamada Python ve Opencv kullanarak bilgisayar kamerasından yüz tespiti yapıyoruz.

Scene text recognition

Detect handwritten words in a text-line (classic image processing method).

Related tags

Overview

Word segmentation

Run demo

Documentation

Parameters

Algorithm

Results

Owner

Harald Scheidl

A curated list of papers and resources for scene text detection and recognition

Document Layout Analysis

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約

Tesseract Open Source OCR Engine (main repository)

Pre-Recognize Library - library with algorithms for improving OCR quality.

Detecting Text in Natural Image with Connectionist Text Proposal Network (ECCV'16)

✌️Using this you can control your PC/Laptop volume by Hand Gestures created with Python.

BD-ALL-DIGIT - This Is Bangladeshi All Sim Cloner Tools

This is a repository to learn and get more computer vision skills, make robotics projects integrating the computer vision as a perception tool and create a lot of awesome advanced controllers for the robots of the future.

A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training

Driver Drowsiness Detection with OpenCV & Dlib

Textboxes implementation with Tensorflow (python)

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

Educational application aimed at automating user-defined workflows for the mobile game, "Granblue Fantasy", using a variety of CV technologies in the backend such as OpenCV, PyAutoGUI and EasyOCR and a frontend coded in Typescript.

ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.

Text Detection from images using OpenCV

Bu uygulamada Python ve Opencv kullanarak bilgisayar kamerasından yüz tespiti yapıyoruz.

Scene text recognition

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約