Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Last update: Dec 06, 2022

Related tags

Overview

This is the official implementation of "Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation".

For more details, please refer to our paper.

Citing the paper

Please cite the paper in your publications if it helps your research:

@inproceedings{lyu2018multi,
      title={Multi-oriented scene text detection via corner localization and region segmentation},
      author={Lyu, Pengyuan and Yao, Cong and Wu, Wenhao and Yan, Shuicheng and Bai, Xiang},
      booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
      pages={7553--7563},
      year={2018}
}

Requirements
Installation
Models
Test
Train
License

Requirements

NVIDIA GPU, Ubuntu 14.04, Python2.7, CUDA8/9
PyTorch 0.2.0_3

Installation

git clone https://github.com/lvpengyuan/corner.git
sh ./make.sh   or  cd rpsroi_pooling && python build.py

Models

Download the model and place it in weights/

Our trained model: Google Drive;

Test

You can test a model in a single scale:

python eval_all.py

or in multi-scale:

python eval_multiscale.py

Note that, you should modify the model path and the test dataset before testing.

Train

python train.py

To train a new model, you should modify the training settings before training.

License

This code is only for academic purpose.

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Related tags

Overview

Citing the paper

Contents

Requirements

Installation

Models

Test

Train

License

Owner

Pengyuan Lyu

Convert Text-to Handwriting Using Python

第一届西安交通大学人工智能实践大赛（2018AI实践大赛--图片文字识别）第一名；仅采用densenet识别图中文字

Camelot: PDF Table Extraction for Humans

This is a implementation of CRAFT OCR method

Detect the mathematical formula from the given picture and the same formula is extracted and converted into the latex code

Tesseract Open Source OCR Engine (main repository)

This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:

A document scanner application for laptops/desktops developed using python, Tkinter and OpenCV.

Text language identification using Wikipedia data

Kornia is a open source differentiable computer vision library for PyTorch.

This repository contains codes on how to handle mouse event using OpenCV

A synthetic data generator for text recognition

Qrcode Attendence System with Opencv and Pyzbar

TextBoxes++: A Single-Shot Oriented Scene Text Detector

7th place solution

Page to PAGE Layout Analysis Tool

Face_mosaic - Mosaic blur processing is applied to multiple faces appearing in the video

Awesome multilingual OCR toolkits based on PaddlePaddle （practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices）

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

Use Youdao OCR API to covert your clipboard image to text.