Corner-based Region Proposal Network

Last update: Nov 04, 2022

Related tags

Overview

Corner-based Region Proposal Network

CRPN is a two-stage detection framework for multi-oriented scene text. It employs corners to estimate the possible locations of text instances and a region-wise subnetwork for further classification and regression. In our experiments, it achieves F-measure of 0.876 and 0.845 on ICDAR 2013 and 2015 respectively. The paper is available at arXiv.

Installation

This code is based on Caffe and py-faster-rcnn. It has been tested on Ubuntu 16.04 with CUDA 8.0.

Clone this repository

git clone https://github.com/xhzdeng/crpn.git

Build Caffe and pycaffe

cd $CRPN_ROOT/caffe-fast-rcnn
make -j8 && make pycaffe

Build the Cython modules
```
cd $CRPN_ROOT/lib
make
```
Prepare your own training data directory. For convenience, it should have this basic structure.
```
$VOCdevkit/
$VOCdevkit/VOC2007                    # image sets, annotations, etc. 
```
And create symlinks for YOUR dataset
```
cd $CRPN_ROOT/data
ln -s [path] VOCdevkit
```
Download pretrained ImageNet VGG-16 model. You can find it at Caffe Model Zoo.

Train with YOUR dataset

cd $CRPN_ROOT
./experiments/scripts/train.sh [NET] [MODEL] [DATASET] [ITER_NUM]
# NET is the network arch to use, only {vgg16} in this implemention
# MODEL is the pre-trained model you want to use to initial your weights
# DATASET points to your dataset, please refer the contents of train.sh
# IETR_NUM

Test with YOUR models

cd $CRPN_ROOT
./experiments/scripts/test.sh [NET] [MODEL] [DATASET]
# NET is the network arch to use, only {vgg16} in this implemention
# MODEL is the testing model
# DATASET points to your dataset, please refer the contents of test.sh

Test outputs are saved under:

output/<experiment directory>/<dataset name>/<network snapshot name>/

Demo

```
cd $CRPN_ROOT
./tools/demo.py --net [NET] --model [MODEL]
# NET is the network arch to use, only {vgg16} in this implemention
# MODEL is the path of caffemodel you want to use
```

Models

Now, you can download the pretrained model from OneDrive or BaiduYun, which is trained 100k iters on SynthText. I also have uploaded a testing model trained recently. It achieves an F-measure of 0.8456 at 840p resolution on ICDAR 2015, similar performance but slightly faster than we depicted in the paper.

Citation

If you find the paper and code useful in your research, please consider citing:

@article{deng2018crpn,
    Title = {Detecting Multi-Oriented Text with Corner-based Region Proposals},
    Author = {Linjie Deng and Yanxiang Gong and Yi Lin and Jingwen Shuai and Xiaoguang Tu and Yufei Zhang and Zheng Ma and Mei Xie},
    Journal = {arXiv preprint arXiv:1804.02690},
    Year = {2018}
}

Corner-based Region Proposal Network

Related tags

Overview

Corner-based Region Proposal Network

Installation

Demo

Models

Citation

Owner

xhzdeng

A Vietnamese personal card OCR website built with Django.

This is the code for our paper DAAIN: Detection of Anomalous and AdversarialInput using Normalizing Flows

Color Picker and Color Detection tool for METR4202

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

CNN+Attention+Seq2Seq

Markup for note taking

When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework (CVPR 2021 oral)

QED-C: The Quantum Economic Development Consortium provides these computer programs and software for use in the fields of quantum science and engineering.

Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation, CVPR 2020 (Oral)

Ackermann Line Follower Robot Simulation.

Generic framework for historical document processing

Creating a virtual tv using opencv in python3.

Detect textlines in document images

A version of nrsc5-gui that merges the interface developed by cmnybo with the architecture developed by zefie in order to start a new baseline that is not heavily dependent upon Python processing.

🔎 Like Chardet. 🚀 Package for encoding & language detection. Charset detection.

OpenCVを用いたカメラキャリブレーションのサンプルです。2021/06/21時点でPython実装のある3種類(通常カメラ向け、魚眼レンズ向け(fisheyeモジュール)、全方位カメラ向け(omnidirモジュール))について用意しています。

Augmenting Anchors by the Detector Itself

With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want.