pytextractor

python ocr using tesseract/ with EAST opencv text detector

Uses the EAST opencv detector defined here with pytesseract to extract text(default) or numbers from images.

Usage main

usage: text_detection.py [-h] [--east EAST] [-c CONFIDENCE] [-w WIDTH]
                         [-e HEIGHT] [-d] [-n] [-p PERCENTAGE] [-b MIN_BOXES]
                         [-i MAX_ITERATIONS]
                         images [images ...]

Text/Number extractor from image

positional arguments:
  images                path(s) to input image(s)

optional arguments:
  -h, --help            show this help message and exit
  --east EAST           path to input EAST text detector
  -c CONFIDENCE, --confidence CONFIDENCE
                        minimum probability required to inspect a region
  -w WIDTH, --width WIDTH
                        resized image width (should be multiple of 32)
  -e HEIGHT, --height HEIGHT
                        resized image height (should be multiple of 32)
  -d, --display         Display bounding boxes
  -n, --numbers         Detect only numbers
  -p PERCENTAGE, --percentage PERCENTAGE
                        Expand/shrink detected bound box
  -b MIN_BOXES, --min-boxes MIN_BOXES
                        minimum number of detected boxes to return
  -i MAX_ITERATIONS, --max-iterations MAX_ITERATIONS
                        max number of iterations finding min_boxes

Usage lib

from pytextractor import pytextractor

extractor = pytextractor.PyTextractor()

Running tests

python setup.py test

make sure tesseract is installed *

brew | apt-get install tesseract

python ocr using tesseract/ with EAST opencv detector

Related tags

Overview

pytextractor

Usage main

Usage lib

Running tests

Owner

Danny Crasto

SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition

This repository summarized computer vision theories.

([email protected]) Boosting Co-teaching with Compression Regularization for Label Noise

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

~1000 book pages + OpenCV + python = page regions identified as paragraphs, lines, images, captions, etc.

Camera Intrinsic Calibration and Hand-Eye Calibration in Pybullet

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

原神风花节自动弹琴辅助

chineseocr/table_line 表格线检测模型pytorch版

TableBank: A Benchmark Dataset for Table Detection and Recognition

Semantic-based Patch Detection for Binary Programs

Deep LearningImage Captcha 2

A buffered and threaded wrapper for the OpenCV VideoCapture object. Can speed up video decoding significantly. Supports

WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework (CVPR 2021 oral)

TextBoxes: A Fast Text Detector with a Single Deep Neural Network https://github.com/MhLiao/TextBoxes 基于SSD改进的文本检测算法，textBoxes_note记录了之前整理的笔记。

RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

ARU-Net - Deep Learning Chinese Word Segment

PSENet - Shape Robust Text Detection with Progressive Scale Expansion Network.

This is a GUI program which consist of 4 OpenCV projects