ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

Last update: Dec 08, 2021

Related tags

Overview

VistaOCR

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

Publications

"How to Efficiently Increase Resolution in Neural OCR Models". Stephen Rawls, Huaigu Cao, Joe Mathai, Prem Natarajan. IEEE Workshop on Arabic Script Analysis and Recognition (ASAR) 2018.

"Combining Convolutional Neural Networks and LSTMs for Segmentation Free OCR". Stephen Rawls, Huaigu Cao, Senthil Kumar, Prem Natarajan. International Conference on Document Analysis and Recognition (ICDAR) 2017.

"Combining Deep Learning and Language Modeling for Segmentation-free OCR From Raw Pixels". Stephen Rawls, Huaigu Cao, Ekraam Sabir, Prem Natarajan. IEEE Workshop on Arabic Script Analysis and Recognition (ASAR) 2017.

Model

Pretrained Models

Coming Soon. Pre-trained models for English, French, and Arabic Handwriting

Performance Numbers

Coming soon. Expected character and word error rates from public datasets.

How to Train

Coming soon.

How to Decode using Existing Model

Coming soon.

Citation

@inproceedings{vistaocr,
  author    = {Stephen Rawls and Huaigu Cao and Senthil Kumar and Prem Natarjan},
  title     = {Combining Convolutional Neural Networks and LSTMs for Segmentation Free OCR},
  booktitle = {Proc. ICDAR},
  year      = {2017},
  url       = {https://doi.org/10.1109/ICDAR.2017.34},
  doi       = {10.1109/ICDAR.2017.34}
}

ISI's Optical Character Recognition (OCR) software for machine-print and handwriting data

Related tags

Overview

VistaOCR

Publications

Model

Pretrained Models

Performance Numbers

How to Train

How to Decode using Existing Model

Citation

Owner

ISI Center for Vision, Image, Speech, and Text Analytics

Some codes from PyImageSearch course's and external projects.

This is an API written in python that uses FastAPI. It is a simple API that can detect discord tokens in Images.

Isearch (OSINT) 🔎 Face recognition reverse image search on Instagram profile feed photos.

Generating .npy dataset and labels out of given image, containing numbers from 0 to 9, using opencv

Primary QPDF source code and documentation

Binarize document images

Multi-choice answer sheet correction system using computer vision with opencv & python.

A simple Digits Recogniser made in Python

Converts an image into funny, smaller amongus characters

POT : Python Optimal Transport

Text recognition (optical character recognition) with deep learning methods.

This repository contains the code for the paper "SCANimate: Weakly Supervised Learning of Skinned Clothed Avatar Networks"

Camelot: PDF Table Extraction for Humans

Text-to-Image generation

Crop regions in napari manually

Programa que viabiliza a OCR (Optical Character Reading - leitura óptica de caracteres) de um PDF.

docstrum

【Auto】原神⭐钓鱼辅助工具 | 自动收竿、校准游标 | ✨您只需要抛出鱼竿，我们会帮你完成一切✨

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Read Japanese manga inside browser with selectable text.