MXNet OCR implementation. Including text recognition and detection.

Last update: Nov 01, 2022

Overview

insightocr

Text Recognition Accuracy on Chinese dataset by caffe-ocr

Network	LSTM	4x1 Pooling	Gray	Test Acc
SimpleNet	N	Y	Y	99.37%
SE-ResNet34	N	Y	Y	99.73%

Text Recognition Accuracy on VGG_Text, on the subset of label size<=18

Network	LSTM	4x1 Pooling	Gray	Test Acc
SimpleNet	Y	Y	Y	87.17%
SE-ResNet50-PReLU	Y	Y	Y	94.05%
SE-ResNeXt101-PReLU	Y	Y	Y	94.38%

Owner

Deep Insight

洞见

GitHub Repository

MXNet OCR implementation. Including text recognition and detection.

insightocr Text Recognition Accuracy on Chinese dataset by caffe-ocr Network LSTM 4x1 Pooling Gray Test Acc SimpleNet N Y Y 99.37% SE-ResNet34 N Y Y 9

99 Nov 01, 2022

An Implementation of the FOTS: Fast Oriented Text Spotting with a Unified Network

FOTS: Fast Oriented Text Spotting with a Unified Network Introduction This is a pytorch re-implementation of FOTS: Fast Oriented Text Spotting with a

171 Aug 04, 2022

Code for the "Sensing leg movement enhances wearable monitoring of energy expenditure" paper.

EnergyExpenditure Code for the "Sensing leg movement enhances wearable monitoring of energy expenditure" paper. Additional data for replicating this s

42 Oct 26, 2022

Smart computer vision application

Smart-computer-vision-application Backend : opencv and python Library required:

2 Jan 31, 2022

WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching Code based on our WACV 2022 Accepted Paper: https://arxiv.org/pdf/

13 Dec 17, 2022

A curated list of awesome synthetic data for text location and recognition

awesome-SynthText A curated list of awesome synthetic data for text location and recognition and OCR datasets. Text location SynthText SynthText_Chine

283 Jan 05, 2023

Virtual Zoom Gesture using OpenCV

Virtual_Zoom_Gesture I have created a virtual zoom gesture where we can Zoom in and Zoom out any image and even we can move that image anywhere on the

2 Dec 26, 2021

[BMVC'21] Official PyTorch Implementation of Grounded Situation Recognition with Transformers

Grounded Situation Recognition with Transformers Paper | Model Checkpoint This is the official PyTorch implementation of Grounded Situation Recognitio

18 Jul 19, 2022

text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network

text-detection-ctpn Scene text detection based on ctpn (connectionist text proposal network). It is implemented in tensorflow. The origin paper can be

3.3k Dec 30, 2022

Converts an image into funny, smaller amongus characters

SussyImage Converts an image into funny, smaller amongus characters Demo Mona Lisa | Lona Misa (Made up of AmongUs characters) API I've also added an

14 Aug 18, 2022

Polaris is a Face recognition attendance system .

Support Me 🚀 About Polaris 📄 Polaris is a system based on facial recognition with a futuristic GUI design, Can easily find people informations store

215 Dec 26, 2022

A pkg stiching around view images(4-6cameras) to generate bird's eye view.

AVP-BEV-OPEN Please check our new work AVP_SLAM_SIM A pkg stiching around view images(4-6cameras) to generate bird's eye view! View Demo · Report Bug

37 Dec 01, 2022

一键翻译各类图片内文字

一键翻译各类图片内文字针对群内、各个图站上大量不太可能会有人去翻译的图片设计，让我这种日语小白能够勉强看懂图片主要支持日语，不过也能识别汉语和小写英文支持简单的涂白和嵌字

574 Dec 28, 2022

This is a GUI program which consist of 4 OpenCV projects

Tkinter-OpenCV Project Using Tkinter, Opencv, Mediapipe This is a python GUI program using Tkinter which consist of 4 OpenCV projects 1. Finger Counte

3 Feb 22, 2022

Some bits of javascript to transcribe scanned pages using PageXML

nashi (nasḫī) Some bits of javascript to transcribe scanned pages using PageXML. Both ltr and rtl languages are supported. Try it! But wait, there's m

15 Nov 09, 2022

nofacedb/faceprocessor is a face recognition engine for NoFaceDB program complex.

faceprocessor nofacedb/faceprocessor is a face recognition engine for NoFaceDB program complex. Tech faceprocessor uses a number of open source projec

3 Sep 06, 2021

CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)

CUTIE TensorFlow implementation of the paper "CUTIE: Learning to Understand Documents with Convolutional Universal Text Information Extractor." Xiaohu

147 Dec 20, 2022

OpenCV-Erlang/Elixir bindings

evision [WIP] : OS : arch Build Status Ubuntu 20.04 arm64 Ubuntu 20.04 armv7 Ubuntu 20.04 s390x Ubuntu 20.04 ppc64le Ubuntu 20.04 x86_64 macOS 11 Big

194 Jan 05, 2023

The CIS OCR PostCorrectionTool

The CIS OCR Post Correction Tool PoCoTo Source code for the Java-based PoCoTo client enabling fast interactive batch corrections of complete OCR error

36 Dec 15, 2022

Open Source Differentiable Computer Vision Library for PyTorch

Kornia is a differentiable computer vision library for PyTorch. It consists of a set of routines and differentiable modules to solve generic computer

7.6k Jan 04, 2023

MXNet OCR implementation. Including text recognition and detection.

Related tags

Overview

insightocr

Text Recognition Accuracy on Chinese dataset by caffe-ocr

Text Recognition Accuracy on VGG_Text, on the subset of label size<=18

Owner

Deep Insight

MXNet OCR implementation. Including text recognition and detection.

An Implementation of the FOTS: Fast Oriented Text Spotting with a Unified Network

Code for the "Sensing leg movement enhances wearable monitoring of energy expenditure" paper.

Smart computer vision application

WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

A curated list of awesome synthetic data for text location and recognition

Virtual Zoom Gesture using OpenCV

[BMVC'21] Official PyTorch Implementation of Grounded Situation Recognition with Transformers

text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network

Converts an image into funny, smaller amongus characters

Polaris is a Face recognition attendance system .

A pkg stiching around view images(4-6cameras) to generate bird's eye view.

一键翻译各类图片内文字

This is a GUI program which consist of 4 OpenCV projects

Some bits of javascript to transcribe scanned pages using PageXML

nofacedb/faceprocessor is a face recognition engine for NoFaceDB program complex.

CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)

OpenCV-Erlang/Elixir bindings

The CIS OCR PostCorrectionTool

Open Source Differentiable Computer Vision Library for PyTorch