A post-processing tool for scanned sheets of paper.

Last update: Dec 07, 2022

Related tags

Overview

unpaper

Originally written by Jens Gulden — see AUTHORS for more information. Licensed under GNU GPL v2 — see COPYING for more information.

Overview

unpaper is a post-processing tool for scanned sheets of paper, especially for book pages that have been scanned from previously created photocopies. The main purpose is to make scanned book pages better readable on screen after conversion to PDF. Additionally, unpaper might be useful to enhance the quality of scanned pages before performing optical character recognition (OCR).

unpaper tries to clean scanned images by removing dark edges that appeared through scanning or copying on areas outside the actual page content (e.g. dark areas between the left-hand-side and the right-hand-side of a double- sided book-page scan).

The program also tries to detect misaligned centering and rotation of pages and will automatically straighten each page by rotating it to the correct angle. This process is called "deskewing".

Note that the automatic processing will sometimes fail. It is always a good idea to manually control the results of unpaper and adjust the parameter settings according to the requirements of the input. Each processing step can also be disabled individually for each sheet.

See further documentation for the supported file formats notes.

Dependencies

The only hard dependency of unpaper is ffmpeg, which is used for file input and output.

Building instructions

unpaper uses GNU Autotools for its build system, so you should be able to execute the same commands used for other software packages:

./configure
make
sudo make install

There are, though, some recommendations about the way you build the code. Since the tasks are calculation-intensive, it is important to build with optimizations turned on:

./configure CFLAGS="-O2 -march-native -pipe"

Even better, if your compiler supports it, is to use Link-Time Optimizations, as that has shown that execution time can improve sensibly:

./configure CFLAGS="-O2 -march=native -pipe -flto"

Further optimizations such as -ftracer and -ftree-vectorize are thought to work, but their effect has not been evaluated so your mileage may vary.

Further Information

You can find more information on the basic concepts and the image processing in the available documentation.

A post-processing tool for scanned sheets of paper.

Related tags

Overview

unpaper

Overview

Dependencies

Building instructions

Further Information

Owner

EAST for ICPR MTWI 2018 Challenge II (Text detection of network images)

Motion Detection Squid Game with OpenCV Python

Optical character recognition for Japanese text, with the main focus being Japanese manga

OCR, Scene-Text-Understanding, Text Recognition

Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

基于图像识别的开源RPA工具，理论上可以支持所有windows软件和网页的自动化

Repository collecting all the submodules for the new PyTorch-based OCR System.

Run tesseract with the tesserocr bindings with @OCR-D's interfaces

[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

This tool will help you convert your text to handwriting xD

Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder

Using computer vision method to recognize and calcutate the features of the architecture.

A curated list of awesome synthetic data for text location and recognition

Machine Leaning applied to denoise images to improve OCR Accuracy

A curated list of resources dedicated to scene text localization and recognition

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

Educational application aimed at automating user-defined workflows for the mobile game, "Granblue Fantasy", using a variety of CV technologies in the backend such as OpenCV, PyAutoGUI and EasyOCR and a frontend coded in Typescript.

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

fishington.io bot with OpenCV and NumPy