An interactive document scanner built in Python using OpenCV

Last update: Feb 12, 2022

Related tags

Overview

Document Scanner

An interactive document scanner built in Python using OpenCV

The scanner takes a poorly scanned image, finds the corners of the document, applies the perspective transformation to get a top-down view of the document, sharpens the image, and applies an adaptive color threshold to clean up the image.

On my test dataset of 280 images, the program correctly detected the corners of the document 92.8% of the time.

This project makes use of the transform and imutils modules from pyimagesearch (which can be accessed here). The UI code for the interactive mode is adapted from poly_editor.py from here.

You can manually click and drag the corners of the document to be perspective transformed:
The scanner can also process an entire directory of images automatically and save the output in an output directory:

Here are some examples of images before and after scan:

Usage

python scan.py (--images 
   
     | --image 
    
     ) [-i]

The -i flag enables interactive mode, where you will be prompted to click and drag the corners of the document. For example, to scan a single image with interactive mode enabled:

python scan.py --image sample_images/desk.JPG -i

Alternatively, to scan all images in a directory without any input:

python scan.py --images sample_images

An interactive document scanner built in Python using OpenCV

Related tags

Overview

Document Scanner

An interactive document scanner built in Python using OpenCV

Here are some examples of images before and after scan:

Usage

Owner

Kushal Shingote

A simple python program to record security cam footage by detecting a face and body of a person in the frame.

Opencv face recognition desktop application

天池2021"全球人工智能技术创新大赛"【赛道一】：医学影像报告异常检测 - 第三名解决方案

A Python wrapper for Google Tesseract

GDB python tool to pretty print and debug c++ xtensor containers

Image processing in Python

【Auto】原神⭐钓鱼辅助工具 | 自动收竿、校准游标 | ✨您只需要抛出鱼竿，我们会帮你完成一切✨

Lightning Fast Language Prediction 🚀

PAGE XML format collection for document image page content and more

With the virtual keyboard, you can write on the real time images by combining the thumb and index fingers on the letter you want.

基于openpose和图像分类的手语识别项目

Document blur detection based on Laplacian operator and text detection.

A tensorflow implementation of EAST text detector

caffe re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

Corner-based Region Proposal Network

Usando o Amazon Textract como OCR para Extração de Dados no DynamoDB

Introduction to Augmented Reality (AR) with Python 3 and OpenCV 4.2.

Random maze generator and solver

Détection de créneaux de vaccination disponibles pour l'outil ViteMaDose

Bu uygulamada Python ve Opencv kullanarak bilgisayar kamerasından yüz tespiti yapıyoruz.