A pure pytorch implemented ocr project including text detection and recognition

Last update: Dec 30, 2022

Overview

ocr.pytorch

A pure pytorch implemented ocr project.
Text detection is based CTPN and text recognition is based CRNN.
More detection and recognition methods will be supported!

Prerequisite

python-3.5+
pytorch-0.4.1+
torchvision-0.2.1
opencv-3.4.0.14
numpy-1.14.3

They could all be installed through pip except pytorch and torchvision. As for pytorch and torchvision, they both depends on your CUDA version, you would prefer to reading pytorch's official site

Detection

Detection is based on CTPN, some codes are borrowed from pytorch_ctpn, several detection results:

Recognition

Recognition is based on CRNN, some codes are borrowed from crnn.pytorch

Test

Download pretrained models from Baidu Netdisk (extract code: u2ff) or Google Driver and put these files into checkpoints. Then run

python3 demo.py

The image files in ./test_images will be tested for text detection and recognition, the results will be stored in ./test_result.

If you want to test a single image, run

python3 test_one.py [filename]

Train

Training codes are placed into train_code directory.
Train CTPN
Train CRNN

Licence

MIT License

A pure pytorch implemented ocr project including text detection and recognition

Related tags

Overview

ocr.pytorch

Prerequisite

Detection

Recognition

Test

Train

Licence

Owner

coura

Um simples projeto para fazer o reconhecimento do captcha usado pelo jogo bombcrypto

An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

This project proposes a camera vision based cursor control system, using hand moment captured from a webcam through a landmarks of hand by using Mideapipe module

Thresholding-and-masking-using-OpenCV - Image Thresholding is used for image segmentation

Using computer vision method to recognize and calcutate the features of the architecture.

利用Paddle框架复现CRAFT

Recognizing cropped text in natural images.

Balabobapy - Using artificial intelligence algorithms to continue the text

Histogram specification using openCV in python .

nofacedb/faceprocessor is a face recognition engine for NoFaceDB program complex.

OpenCV-Erlang/Elixir bindings

Omdena-abuja-anpd - Automatic Number Plate Detection for the security of lives and properties using Computer Vision.

Solution for Problem 1 by team codesquad for AIDL 2020. Uses ML Kit for OCR and OpenCV for image processing

Python-based tools for document analysis and OCR

Fully-automated scripts for collecting AI-related papers

Repository collecting all the submodules for the new PyTorch-based OCR System.

Code release for Hu et al., Learning to Segment Every Thing. in CVPR, 2018.

Machine Leaning applied to denoise images to improve OCR Accuracy

This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"

Isearch (OSINT) 🔎 Face recognition reverse image search on Instagram profile feed photos.