OCR of Chicago 1909 Renumbering Plan

Last update: Nov 21, 2021

Related tags

Computer Vision 1909

Overview

Requirements:

Python 3 (probably at least 3.4)
pipenv (pip3 install pipenv)
tesseract (brew install tesseract, at least if you have a mac and homebrew working)
imagemagick / ghostscript

Using this repository:

The working/ subfolders contain a folder for each page. Each contains a page.png file that's the baseline page. It'll attempt to auto-deskew and crop each page. If you want to manually override this process, create a page-handcrop.png file in the working directory. Some already have them.

pipenv install

make all at the top level should attempt to deskew, crop, split, and OCR everything, building CSV output in each working dir.

pipenv shell

make setup

make all

After that, concatenating all the page.csv files in each working dir should work.

csvstack working/*/page.csv > all_data.csv

Owner

ted whalen

GitHub Repository

Convert PDF/Image to TXT using EasyOcr - the best OCR engine available!

PDFImage2TXT - DOWNLOAD INSTALLER HERE What can you do with it? Convert scanned PDFs to TXT. Convert scanned Documents to TXT. No coding required!! In

2 Feb 22, 2022

Text modding tools for FF7R (Final Fantasy VII Remake)

FF7R_text_mod_tools Subtitle modding tools for FF7R (Final Fantasy VII Remake) There are 3 tools I made. make_dualsub_mod.exe: Merges (or swaps) subti

10 Dec 19, 2022

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

InceptText-Tensorflow An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Orien

115 Dec 12, 2022

Camelot: PDF Table Extraction for Humans

Camelot: PDF Table Extraction for Humans Camelot is a Python library that makes it easy for anyone to extract tables from PDF files! Note: You can als

3.3k Dec 31, 2022

A buffered and threaded wrapper for the OpenCV VideoCapture object. Can speed up video decoding significantly. Supports

A buffered and threaded wrapper for the OpenCV VideoCapture object. Can speed up video decoding significantly. Supports "with"-syntax.

0 Oct 30, 2021

Optical character recognition for Japanese text, with the main focus being Japanese manga

Manga OCR Optical character recognition for Japanese text, with the main focus being Japanese manga. It uses a custom end-to-end model built with Tran

327 Jan 01, 2023

Some bits of javascript to transcribe scanned pages using PageXML

nashi (nasḫī) Some bits of javascript to transcribe scanned pages using PageXML. Both ltr and rtl languages are supported. Try it! But wait, there's m

15 Nov 09, 2022

a Deep Learning Framework for Text

DeLFT DeLFT (Deep Learning Framework for Text) is a Keras and TensorFlow framework for text processing, focusing on sequence labelling (e.g. named ent

350 Dec 19, 2022

Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)

An Image is Worth 16x16 Words, What is a Video Worth? paper Official PyTorch Implementation Gilad Sharir, Asaf Noy, Lihi Zelnik-Manor DAMO Academy, Al

213 Nov 12, 2022

CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)

CUTIE TensorFlow implementation of the paper "CUTIE: Learning to Understand Documents with Convolutional Universal Text Information Extractor." Xiaohu

147 Dec 20, 2022

SemTorch

SemTorch This repository contains different deep learning architectures definitions that can be applied to image segmentation. All the architectures a

154 Dec 07, 2022

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

EasyOCR Ready-to-use OCR with 80+ languages supported including Chinese, Japanese, Korean and Thai. What's new 1 February 2021 - Version 1.2.3 Add set

16.7k Jan 03, 2023

OCR of Chicago 1909 Renumbering Plan

Related tags

Overview

Owner

ted whalen

Convert PDF/Image to TXT using EasyOcr - the best OCR engine available!

Text modding tools for FF7R (Final Fantasy VII Remake)

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

Camelot: PDF Table Extraction for Humans

A buffered and threaded wrapper for the OpenCV VideoCapture object. Can speed up video decoding significantly. Supports

Optical character recognition for Japanese text, with the main focus being Japanese manga

Some bits of javascript to transcribe scanned pages using PageXML

a Deep Learning Framework for Text

Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)

CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)

SemTorch

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

A simple OCR API server, seriously easy to be deployed by Docker, on Heroku as well

Histogram specification using openCV in python .

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

OCR powered screen-capture tool to capture information instead of images

POT : Python Optimal Transport

One Metrics Library to Rule Them All!

An organized collection of tutorials and projects created for aspriring computer vision students.