A pure pytorch implemented ocr project including text detection and recognition

Last update: Dec 30, 2022

Overview

ocr.pytorch

A pure pytorch implemented ocr project.
Text detection is based CTPN and text recognition is based CRNN.
More detection and recognition methods will be supported!

Prerequisite

python-3.5+
pytorch-0.4.1+
torchvision-0.2.1
opencv-3.4.0.14
numpy-1.14.3

They could all be installed through pip except pytorch and torchvision. As for pytorch and torchvision, they both depends on your CUDA version, you would prefer to reading pytorch's official site

Detection

Detection is based on CTPN, some codes are borrowed from pytorch_ctpn, several detection results:

Recognition

Recognition is based on CRNN, some codes are borrowed from crnn.pytorch

Test

Download pretrained models from Baidu Netdisk (extract code: u2ff) or Google Driver and put these files into checkpoints. Then run

python3 demo.py

The image files in ./test_images will be tested for text detection and recognition, the results will be stored in ./test_result.

If you want to test a single image, run

python3 test_one.py [filename]

Train

Training codes are placed into train_code directory.
Train CTPN
Train CRNN

Licence

MIT License

A pure pytorch implemented ocr project including text detection and recognition

Related tags

Overview

ocr.pytorch

Prerequisite

Detection

Recognition

Test

Train

Licence

Owner

coura

Automatically remove the mosaics in images and videos, or add mosaics to them.

A simple document layout analysis using Python-OpenCV

A novel region proposal network for more general object detection ( including scene text detection ).

Textboxes implementation with Tensorflow (python)

Text layer for bio-image annotation.

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

End-to-end pipeline for real-time scene text detection and recognition.

EQFace: An implementation of EQFace: A Simple Explicit Quality Network for Face Recognition

Balabobapy - Using artificial intelligence algorithms to continue the text

🔎 Like Chardet. 🚀 Package for encoding & language detection. Charset detection.

Play the Namibian game of Owela against a terrible AI. Built using Django and htmx.

nofacedb/faceprocessor is a face recognition engine for NoFaceDB program complex.

Text modding tools for FF7R (Final Fantasy VII Remake)

Generates a message from the infamous Jerma Impostor image

An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

Code for the paper STN-OCR: A single Neural Network for Text Detection and Text Recognition

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

Implementation of our paper 'PixelLink: Detecting Scene Text via Instance Segmentation' in AAAI2018

Machine Leaning applied to denoise images to improve OCR Accuracy

Neural search engine for AI papers