Textboxes : Image Text Detection Model : python package (tensorflow)

Last update: Dec 15, 2022

Overview

shinTB

Abstract

A python package for use Textboxes : Image Text Detection Model

implemented by tensorflow, cv2

Textboxes Paper Review in Korean (My Blog) : shinjayne.github.io/textboxes

shintb : useable textboxes python package (Source codes are in here)

svt1 : Street view Text dataset. can use with shintb.svt_data_loader.SVTDataLoader when training Textboxes model

config.py : (NECESSARY) configuration of model building and training with shinTB

main.py : simple example useage of shinTB package

Dependancies

python Version: 3.5.3
numpy Version: 1.13.0
tensorflow Version: 1.2.1
cv2

How to use

Clone this repository to your local.
You will use shintb python package and config.py for building and training your own Textboxes model.
svt1 gives us training / test data.
Open new python file.
Import config.config and shintb.

from config import config
from shintb import graph_drawer, default_box_control, svt_data_loader, runner

Initialize GraphDrawer,DefaultBoxControl,SVTDataLoader instance.

graphdrawer = graph_drawer.GraphDrawer(config)

dataloader = svt_data_loader.SVTDataLoader('./svt1/train.xml', './svt1/test.xml')

dbcontrol = default_box_control.DefaultBoxControl(config, graphdrawer)

GraphDrawer instance contains a tensorflow graph of Textboxes.
DefaultboxControl instance contains methods and attributes which is related to default box.
SVTDataLoader instance loads data from svt1.
Initialize Runner instance.

runner = runner.Runner(config, graphdrawer, dataloader, dbcontrol)

Runner uses GraphDrawer,DefaultBoxControl,SVTDataLoader instance.
If you want to train your Textboxes model, use Runner.train(). Every 1000 step, shintb will save ckpt file in the directory you set in config.py.

runner.train()

If you want to validate/test your model, use Runner.test()

runner.test()

After training, if you want to detect texts from one image use Runner.image().

runner.image(<your_image_directory>)

Textboxes : Image Text Detection Model : python package (tensorflow)

Related tags

Overview

shinTB

Abstract

Dependancies

How to use

Owner

Jayne Shin (신재인)

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Create single line SVG illustrations from your pictures

text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network

InverseRenderNet: Learning single image inverse rendering, CVPR 2019.

Some bits of javascript to transcribe scanned pages using PageXML

SemTorch

Scan the MRZ code of a passport and extract the firstname, lastname, passport number, nationality, date of birth, expiration date and personal numer.

Scene text detection and recognition based on Extremal Region(ER)

SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition

caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

🔎 Like Chardet. 🚀 Package for encoding & language detection. Charset detection.

A collection of resources (including the papers and datasets) of OCR (Optical Character Recognition).

Generating .npy dataset and labels out of given image, containing numbers from 0 to 9, using opencv

A Python wrapper for the tesseract-ocr API

scene-linear test images

Generates a message from the infamous Jerma Impostor image

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:

CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)

Programa que viabiliza a OCR (Optical Character Reading - leitura óptica de caracteres) de um PDF.

This is a real life mario project using python and mediapipe