Textboxes : Image Text Detection Model : python package (tensorflow)

Last update: Dec 15, 2022

Overview

shinTB

Abstract

A python package for use Textboxes : Image Text Detection Model

implemented by tensorflow, cv2

Textboxes Paper Review in Korean (My Blog) : shinjayne.github.io/textboxes

shintb : useable textboxes python package (Source codes are in here)

svt1 : Street view Text dataset. can use with shintb.svt_data_loader.SVTDataLoader when training Textboxes model

config.py : (NECESSARY) configuration of model building and training with shinTB

main.py : simple example useage of shinTB package

Dependancies

python Version: 3.5.3
numpy Version: 1.13.0
tensorflow Version: 1.2.1
cv2

How to use

Clone this repository to your local.
You will use shintb python package and config.py for building and training your own Textboxes model.
svt1 gives us training / test data.
Open new python file.
Import config.config and shintb.

from config import config
from shintb import graph_drawer, default_box_control, svt_data_loader, runner

Initialize GraphDrawer,DefaultBoxControl,SVTDataLoader instance.

graphdrawer = graph_drawer.GraphDrawer(config)

dataloader = svt_data_loader.SVTDataLoader('./svt1/train.xml', './svt1/test.xml')

dbcontrol = default_box_control.DefaultBoxControl(config, graphdrawer)

GraphDrawer instance contains a tensorflow graph of Textboxes.
DefaultboxControl instance contains methods and attributes which is related to default box.
SVTDataLoader instance loads data from svt1.
Initialize Runner instance.

runner = runner.Runner(config, graphdrawer, dataloader, dbcontrol)

Runner uses GraphDrawer,DefaultBoxControl,SVTDataLoader instance.
If you want to train your Textboxes model, use Runner.train(). Every 1000 step, shintb will save ckpt file in the directory you set in config.py.

runner.train()

If you want to validate/test your model, use Runner.test()

runner.test()

After training, if you want to detect texts from one image use Runner.image().

runner.image(<your_image_directory>)

Textboxes : Image Text Detection Model : python package (tensorflow)

Related tags

Overview

shinTB

Abstract

Dependancies

How to use

Owner

Jayne Shin (신재인)

caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

A curated list of papers and resources for scene text detection and recognition

Read Japanese manga inside browser with selectable text.

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

A bot that plays TFT using OCR. Keeps track of bench, board, items, and plays the user defined team comp.

Optical character recognition for Japanese text, with the main focus being Japanese manga

This project modify tensorflow object detection api code to predict oriented bounding boxes. It can be used for scene text detection.

Scan the MRZ code of a passport and extract the firstname, lastname, passport number, nationality, date of birth, expiration date and personal numer.

Regions sanitàries (RS), Sectors Sanitàris (SS) i Àrees Bàsiques de Salut (ABS) de Catalunya

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

A list of hyperspectral image super-solution resources collected by Junjun Jiang

Simple app for visual editing of Page XML files

In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

A selectional auto-encoder approach for document image binarization

This is a pytorch re-implementation of EAST: An Efficient and Accurate Scene Text Detector.

Polaris is a Face recognition attendance system .

QuanTaichi: A Compiler for Quantized Simulations (SIGGRAPH 2021)

Table Extraction Tool

computer vision, image processing and machine learning on the web browser or node.

Text page dewarping using a "cubic sheet" model