It is a image ocr tool using the Tesseract-OCR engine with the pytesseract package and has a GUI.

Last update: Jul 11, 2022

Related tags

Overview

OCR-Tool

It is a image ocr tool made in Python using the Tesseract-OCR engine with the pytesseract package and has a GUI. This is my second ever python project so feel free to make any suggestions

Release

To install it, extract the zip that you downloaded. Put it in a folder like program files. There is a file called OCR-Tool.exe. You could make a shortcut and put it on your desktop for easy access.
Windows might say it's dangerous to run and block it. Just click "more info" and there you can run it.
If you download the version without tesseract included please check the dependencies section for instructions on how to add it.

Version 1.1

With tesseract

https://drive.google.com/file/d/1EMS8cKsasorLRXpqVjLxo41nsAEk4SiF/view?usp=sharing

Without tesseract

https://drive.google.com/file/d/1O4EYF9EmawT0VRSM6U1XkDBndGQBxDRe/view?usp=sharing

Features

Modern GUI
Snipping tool (Credit to harupy's python snipping tool)
Open image from folder
Paste image from clipboard
Save text to .txt
Copy text to clipboard
Cancel snip

Dependencies

Tesseract OCR Engine (UB Mannheim). Install either version 4 or 5. 5 is recommended as it performs better. Look for the installlation folder, the default is program files. Copy it into the same folder as main.py and rename the folder to "tesseract"
Pytesseract
PyQt5

Known bugs

Copy text button crashes app
White text doesnt work well

Future features

Better preprocessing to help with weird backgrounds
Document ocr
Menu

make a better chinese character recognition OCR than tesseract

deep ocr See README_en.md for English installation documentation. 只在ubuntu下面测试通过，需要virtualenv安装，安装路径可自行调整： git clone https://github.com/JinpengLI/deep

1.5k Dec 28, 2022

Convert PDF/Image to TXT using EasyOcr - the best OCR engine available!

PDFImage2TXT - DOWNLOAD INSTALLER HERE What can you do with it? Convert scanned PDFs to TXT. Convert scanned Documents to TXT. No coding required!! In

2 Feb 22, 2022

Responsive Doc. scanner using U^2-Net, Textcleaner and Tesseract

Responsive Doc. scanner using U^2-Net, Textcleaner and Tesseract Toolset U^2-Net is used for background removal Textcleaner is used for image cleaning

3 Jul 13, 2022

Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)

Open Semantic Search https://opensemanticsearch.org Integrated search server, ETL framework for document processing (crawling, text extraction, text a

684 Jan 6, 2023

A Python wrapper for Google Tesseract

Python Tesseract Python-tesseract is an optical character recognition (OCR) tool for python. That is, it will recognize and "read" the text embedded i

4.6k Jan 6, 2023

Awesome multilingual OCR toolkits based on PaddlePaddle （practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices）

English | 简体中文 Introduction PaddleOCR aims to create multilingual, awesome, leading, and practical OCR tools that help users train better models and a

27.5k Jan 8, 2023

OCR engine for all the languages

Description kraken is a turn-key OCR system optimized for historical and non-Latin script material. kraken's main features are: Fully trainable layout

431 Jan 4, 2023

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

OCR.space OCR Result Checker = Draw OCR overlay on top of image Python tool that takes the OCR.space JSON output as input, and draws an overlay on to

4 Oct 18, 2022

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

Attention-based OCR Visual attention-based OCR model for image recognition with additional tools for creating TFRecords datasets and exporting the tra

933 Dec 29, 2022

It is a image ocr tool using the Tesseract-OCR engine with the pytesseract package and has a GUI.

Related tags

Overview

OCR-Tool

Release

Version 1.1

With tesseract

Without tesseract

Features

Dependencies

Known bugs

Future features

You might also like...

make a better chinese character recognition OCR than tesseract

Convert PDF/Image to TXT using EasyOcr - the best OCR engine available!

Responsive Doc. scanner using U^2-Net, Textcleaner and Tesseract

A Python wrapper for Google Tesseract

Awesome multilingual OCR toolkits based on PaddlePaddle （practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices）

OCR engine for all the languages

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

Releases(Release)

Release(Oct 15, 2021)

Changes

Owner

Khant Htet Aung

Smart computer vision application

Library used to deskew a scanned document

Indonesian ID Card OCR using tesseract OCR

A Joint Video and Image Encoder for End-to-End Retrieval

How to detect objects in real time by using Jupyter Notebook and Neural Networks , by using Yolo3

This is a pytorch re-implementation of EAST: An Efficient and Accurate Scene Text Detector.

Text Detection from images using OpenCV

第一届西安交通大学人工智能实践大赛（2018AI实践大赛--图片文字识别）第一名；仅采用densenet识别图中文字

This is the code for our paper DAAIN: Detection of Anomalous and AdversarialInput using Normalizing Flows

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.

Handwritten Number Recognition using CNN and Character Segmentation

RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

Characterizing possible failure modes in physics-informed neural networks.

基于图像识别的开源RPA工具，理论上可以支持所有windows软件和网页的自动化

Introduction to image processing, most used and popular functions of OpenCV

Open Source Computer Vision Library

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

SRA's seminar on Introduction to Computer Vision Fundamentals

Um simples projeto para fazer o reconhecimento do captcha usado pelo jogo bombcrypto