Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Last update: Oct 10, 2022

Overview

Scene Text-Spotting based on PSEnet+CRNN

Pytorch implementation of an end to end Text-Spotter with a PSEnet text detector and CRNN text recognizer. We plan to grow this repository into an open research platform for multi-lingual text detection and recognition from natural scene images, targeted towards low-resource languages.

Requirements

Python 3.6.5
Pytorch 1.2
pyclipper
Polygon 3.0.8
OpenCV 3.4.1

Demo

Download the trained CRNN and PSEnet models from the links provided below.
Copy paths of the models and paste them in params.py
run end-end.py

python end-end.py --img [path to image] --e2e_config_name [end to end config name]

Pre-trained Models

Both PSEnet and CRNN pre-trained models can be found here: gdrive

the PSEnet model is a multi-lingual text detector, trained on MLT 2019. Works quite well!
the CRNN recognizes Hindi, Bangla, Malayalam, Kanada, Tamil, Telugu, Odia, Sanskrit, Marathi!

Download the models in models/ directory and modify params.py if required.

Training instructions

To train your own detection model refer to this file.
To train your own recognition model refer to this file.

Samples

Contributors

Azhar Shaikh, PES University LinkedIn
Nishant Sinha, OffNote Labs

Work done as part of Internship with OffNote Labs.

References

If this repository helps you, please star it. Thank you!

Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Related tags

Overview

Scene Text-Spotting based on PSEnet+CRNN

Requirements

Demo

Pre-trained Models

Training instructions

Samples

Contributors

References

Owner

azhar shaikh

Machine Leaning applied to denoise images to improve OCR Accuracy

OpenMMLab Text Detection, Recognition and Understanding Toolbox

A python scripts that uses 3 different feature extraction methods such as SIFT, SURF and ORB to find a book in a video clip and project trailer of a movie based on that book, on to it.

A simple QR-Code Reader in Python

An advanced 2D image manipulation with features such as edge detection and image segmentation built using OpenCV

Generic framework for historical document processing

Histogram specification using openCV in python .

Open Source Differentiable Computer Vision Library for PyTorch

The official code for the ICCV-2021 paper "Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates".

OCR-D-compliant page segmentation

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.

Script para controlar o movimento do mouse usando Python e openCV com câmera em tempo real que detecta pontos de referência da mão, rastreia padrões de gestos em vez de um mouse físico.

Characterizing possible failure modes in physics-informed neural networks.

Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.

Python library to extract tabular data from images and scanned PDFs

Balabobapy - Using artificial intelligence algorithms to continue the text

Code for CVPR 2022 paper "SoftGroup for Instance Segmentation on 3D Point Clouds"

How to detect objects in real time by using Jupyter Notebook and Neural Networks , by using Yolo3

BNF Globalization Code (CVPR 2016)

Computer vision applications project (Flask and OpenCV)