BoxToolBox is a simple python application built around the openCV library

Last update: Nov 12, 2021

Related tags

Overview

BoxToolBox

BoxToolBox is a simple python application built around the openCV library. It is not a full featured application to guide you through the whole process. It is a missing piece in your toolchain between Lightroom and Photoshop. You still need to take you box pictures as straight as possible with the same camera settings and pre-process them to match the lighting.

It will help you to

correct perspective of the source photos
quickly test the layout and arrangement
place photos to correct location
generate grid to recolor or use as mask to hide the seams

It will not

unify brightness and colors of source photos
magically correct photos taken from bad perspective
assemble final picture

How to install (Linux)

Just clone or download this repository and run the main script. You will also need opencv installed.

pip install opencv-python
git clone https://github.com/fhorinek/BoxToolBox.git
cd BoxToolBox
python BoxToolBox.py

How to install (Windows)

Just download and execute pre-built exe file

It will trigger Windows protection, you need to click on more info and run anyway

How to use it

Here is a quick start video on youtube

Perspective editor

One window acts as input for defining box corners and the second window shows transformation previu. Controls on the second window sets transformed image width and height. The resolution for the final picture will be Width * Grid W x Height * Grid H. Margin define how much of the image will be preserved around the defined box. Preview scale will define size of the temporary pictures used in layout editor. Smaller scale will make the editor go faster, larger scale will provide better quality.

Controls:

Mouse wheel - zoom
Left button - Pan
Middle button - Select point
N key and M key - Open previous and next image
Q - Close editor

Normally you only need to define a transformation box for the first photo. The transformation will be applied to all following pictures. If you bump the camera during the session you can find the first image that is affected and redefine the transformation box. All following images will use that correction.

Layout editor

You can use this window to compose the final image. Here you can change geometry for the final image. Set scaling and spacing for the images. You can use Transparent spacer to define a very precise scale.

If the settings window is not visible press Ctrl-P.

Controls:

Mouse wheel - zoom
Left button - Pan
Drag picture - Swap images
N key and M key - set previous and next image
E key - Open perspective editor for image
C key - Toggle Crop or Full flag for image
S key - Show full image with marker lines
Q key - Close editor
Render - Show final image in full resolution
Output - Render final image in layers

Use different slots to experiment with multiple layouts and geometries.

Output

Output for the image will consist of multiple images placed inside the directory slot_n. Photos in images will be placed to correct location on transparent background. You will also find the generated grid image. Import these images as layers to any photo editor to compose the final image.

Disclaimer

This tool is my hobby project, done in my free time for my personal use. However I think that other people might find it useful so I made extra steps to make it more friendly and easier to install. If you found a bug or want something to add, feel free to open an issue. Pull requests are also welcomed!

If you found it useful and want to thank me, you can buy me a bear :-)

BoxToolBox is a simple python application built around the openCV library

Related tags

Overview

BoxToolBox

How to install (Linux)

How to install (Windows)

How to use it

Perspective editor

Layout editor

Output

Disclaimer

Owner

František Horínek

FOTS Pytorch Implementation

SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

A post-processing tool for scanned sheets of paper.

Using computer vision method to recognize and calcutate the features of the architecture.

This is a passport scanning web service to help you scan, identify and validate your passport created with a simple and flexible design and ready to be integrated right into your system!

7th place solution

This pyhton script converts a pdf to Image then using tesseract as OCR engine converts Image to Text

Detect handwritten words in a text-line (classic image processing method).

An application of high resolution GANs to dewarp images of perturbed documents

OCR-D-compliant page segmentation

CNN+Attention+Seq2Seq

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

Image Smoothing and Blurring Using OpenCV

A Vietnamese personal card OCR website built with Django.

A version of nrsc5-gui that merges the interface developed by cmnybo with the architecture developed by zefie in order to start a new baseline that is not heavily dependent upon Python processing.

Introduction to Augmented Reality (AR) with Python 3 and OpenCV 4.2.

Code for the paper STN-OCR: A single Neural Network for Text Detection and Text Recognition

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: