An interactive document scanner built in Python using OpenCV

Last update: Feb 12, 2022

Related tags

Overview

Document Scanner

An interactive document scanner built in Python using OpenCV

The scanner takes a poorly scanned image, finds the corners of the document, applies the perspective transformation to get a top-down view of the document, sharpens the image, and applies an adaptive color threshold to clean up the image.

On my test dataset of 280 images, the program correctly detected the corners of the document 92.8% of the time.

This project makes use of the transform and imutils modules from pyimagesearch (which can be accessed here). The UI code for the interactive mode is adapted from poly_editor.py from here.

You can manually click and drag the corners of the document to be perspective transformed:
The scanner can also process an entire directory of images automatically and save the output in an output directory:

Here are some examples of images before and after scan:

Usage

python scan.py (--images 
   
     | --image 
    
     ) [-i]

The -i flag enables interactive mode, where you will be prompted to click and drag the corners of the document. For example, to scan a single image with interactive mode enabled:

python scan.py --image sample_images/desk.JPG -i

Alternatively, to scan all images in a directory without any input:

python scan.py --images sample_images

An interactive document scanner built in Python using OpenCV

Related tags

Overview

Document Scanner

An interactive document scanner built in Python using OpenCV

Here are some examples of images before and after scan:

Usage

Owner

Kushal Shingote

PyNeuro is designed to connect NeuroSky's MindWave EEG device to Python and provide Callback functionality to provide data to your application in real time.

Forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Deep learning based page layout analysis

Textboxes implementation with Tensorflow (python)

Fast style transfer

Handwritten Text Recognition (HTR) using TensorFlow 2.x

BD-ALL-DIGIT - This Is Bangladeshi All Sim Cloner Tools

Application that instantly translates sign-language to letters.

Natural language detection

SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition

OpenCVを用いたカメラキャリブレーションのサンプルです。2021/06/21時点でPython実装のある3種類(通常カメラ向け、魚眼レンズ向け(fisheyeモジュール)、全方位カメラ向け(omnidirモジュール))について用意しています。

一键翻译各类图片内文字

PAGE XML format collection for document image page content and more

Handwriting Recognition System based on a deep Convolutional Recurrent Neural Network architecture

Markup for note taking

Satoshi is a discord bot template in python using discord.py that allow you to track some live crypto prices with your own discord bot.

Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)

A Python wrapper for Google Tesseract