Tool which allow you to detect and translate text.

Last update: Nov 28, 2022

Overview

Text detection and recognition

This repository contains tool which allow to detect region with text and translate it one by one.

Description

Two pretrained neural networks are used. One of them is responsible for detecting places in which text appear and return its coordinates. Structure use for this operation is based on CRAFT architecture.

Craft Paper

Second network take detected words and recognize words included inside it. Convolutional Recurrential neural networks (CRNN) are used for this operation.

CRNN Paper

Example

Under construction

Deployment

I decided to deploy it on heroku (temporarily solution), but the amount of memory available on this platform is not enough. You can check it on heroku app. I decided to add bootstrap template because whole solution become more intuitive.

Windows Installation

To install it locally, you can run from your virtual env

python -m pip install requirements.txt

Linux installation

to install it properly on Linux OS you have to install additionaly


apt-get update
apt-get install -y libsm6 libxext6 libxrender-dev
pip install opencv-python

If problems with cv2 imports are still appearing then you should install

pip install opencv-contrib-python

Then you can run

```python
python -m pip install requirements.txt

Run

To run it locally, please activate your environment

> win
venv\Scripts\activate.bat

>linux
source venv\Scripts\activate

and run straight from project origin

python  app.py

If everything goes properly, you'll see on localhost:8000, screen just like one below.

Updates

I decided to remove argparse, because as I mention earlier, it was less intuitive. Solution is not fast, is more like an toy example which shows how to use Pytorch model on deployment environment.

Version which I use here contain torch-cpu which make preprocessing and detecting slightly slower. I test it on cuda and it was much faster.

If you have more information, drop me a line If you like it, give a star

Draft: Show how does it work on complex .tif example document.

Contact Info

Tool which allow you to detect and translate text.

Related tags

Overview

Text detection and recognition

Description

Example

Deployment

Windows Installation

Linux installation

Run

Updates

Owner

Damian Panek

Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.

Repositório para registro de estudo da biblioteca opencv (Python)

Image processing in Python

Apply different text recognition services to images of handwritten documents.

Creating of virtual elements of the graphical interface using opencv and mediapipe.

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約

Satoshi is a discord bot template in python using discord.py that allow you to track some live crypto prices with your own discord bot.

Automatically download multiple papers by keywords in CVPR

"Very simple but works well" Computer Vision based ID verification solution provided by LibraX.

Super Mario Game With Python

A simple Security Camera created using Opencv in Python where images gets saved in realtime in your Dropbox account at every 5 seconds

An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments

Awesome multilingual OCR toolkits based on PaddlePaddle （practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices）

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

Detect textlines in document images

pyntcloud is a Python library for working with 3D point clouds.

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Handwriting Recognition System based on a deep Convolutional Recurrent Neural Network architecture

Dataset and Code for ICCV 2021 paper "Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme"

Tool which allow you to detect and translate text.

Related tags

Overview

Text detection and recognition

Description

Example

Deployment

Windows Installation

Linux installation

Run

Updates

Owner

Damian Panek

Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.

Repositório para registro de estudo da biblioteca opencv (Python)

Image processing in Python

Apply different text recognition services to images of handwritten documents.

Creating of virtual elements of the graphical interface using opencv and mediapipe.

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約

Satoshi is a discord bot template in python using discord.py that allow you to track some live crypto prices with your own discord bot.

Automatically download multiple papers by keywords in CVPR

"Very simple but works well" Computer Vision based ID verification solution provided by LibraX.

Super Mario Game With Python

A simple Security Camera created using Opencv in Python where images gets saved in realtime in your Dropbox account at every 5 seconds

An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments

Awesome multilingual OCR toolkits based on PaddlePaddle （practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices）

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

Detect textlines in document images

pyntcloud is a Python library for working with 3D point clouds.

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection (TIP 2019)

Handwriting Recognition System based on a deep Convolutional Recurrent Neural Network architecture

Dataset and Code for ICCV 2021 paper "Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme"

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約