TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Last update: Dec 12, 2022

Overview

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection

Introduction

The code and trained models of:

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection, TIP 2019 [Paper]

Citation

Please cite the related works in your publications if it helps your research:


@article{xu2018textfield,
  title={TextField: Learning A Deep Direction Field for Irregular Scene Text Detection},
  author={Xu, Yongchao and Wang, Yukang and Zhou, Wei and Wang, Yongpan and Yang, Zhibo and Bai, Xiang},
  journal={arXiv preprint arXiv:1812.01393},
  year={2018}
}

Prerequisite

Caffe and SynthText pretrained model [Link]
Datasets: [Total-Text], [ICDAR2015]
OpenCV 3.4.3
MATLAB

Usage

1. Install Caffe

cp Makefile.config.example Makefile.config
# adjust Makefile.config (for example, enable python layer)
make all -j16
# make sure to include $CAFFE_ROOT/python to your PYTHONPATH.
make pycaffe

Please refer to Caffe Installation to ensure other dependencies.

2. Data and model preparation

# download datasets and pretrained model then
mkdir data && mv [your_dataset_folder] data/
mkdir models && mv [your_pretrained_model] models/

3. Training scripts

# an example on Total-Text dataset
cd examples/TextField/
python train.py --gpu [your_gpu_id] --dataset total --initmodel ../../models/synth_iter_800000.caffemodel

4. Evaluation scripts

# an example on Total-Text dataset
cd evaluation/total/
./eval.sh

Results and Trained Models

Total-Text

Recall	Precision	F-measure	Link
0.816	0.824	0.820	[Google drive]

*lambda=0.50 for post-processing

ICDAR2015

Recall	Precision	F-measure	Link
0.811	0.846	0.828	[Google drive]

*lambda=0.75 for post-processing

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Related tags

Overview

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection

Introduction

Citation

Prerequisite

Usage

1. Install Caffe

2. Data and model preparation

3. Training scripts

4. Evaluation scripts

Results and Trained Models

Total-Text

ICDAR2015

Owner

Yukang Wang

A bot that plays TFT using OCR. Keeps track of bench, board, items, and plays the user defined team comp.

FastOCR is a desktop application for OCR API.

Convert PDF/Image to TXT using EasyOcr - the best OCR engine available!

Select range and every time the screen changes, OCR is activated.

Image processing is one of the most common term in computer vision

The CIS OCR PostCorrectionTool

Read Japanese manga inside browser with selectable text.

Rubik's Cube in pygame with OpenGL

This repository contains the code for the paper "SCANimate: Weakly Supervised Learning of Skinned Clothed Avatar Networks"

This is the implementation of the paper "Gated Recurrent Convolution Neural Network for OCR"

Localization of thoracic abnormalities model based on VinBigData (top 1%)

Open Source Differentiable Computer Vision Library for PyTorch

OCR system for Arabic language that converts images of typed text to machine-encoded text.

Some bits of javascript to transcribe scanned pages using PageXML

OCR-D-compliant page segmentation

Page to PAGE Layout Analysis Tool

Framework for the Complete Gaze Tracking Pipeline

MXNet OCR implementation. Including text recognition and detection.

QuanTaichi: A Compiler for Quantized Simulations (SIGGRAPH 2021)