Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network.

Last update: Dec 27, 2022

Overview

Dewarping Document Image

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network.

Please browse 90_paper.pdf

Dewarping Process

We predict the displacement and the categories (foreground or background) at pixellevel by applying two tasks in FCN, and then remove the background of the input image, and mapped the foreground pixels to rectified image by interpolation according to the predicted displacements. The cracks maybe emerge in rectified image when using a forward mapping interpolation. Therefore, we construct Delaunay triangulations in all scattered pixels and then using interpolation.

Compare

Notice

2020.11.10 update the result file, including 6-25_11_52_54-49-rgb_ and 6-25_11_52_54-49_.
2022.2.17 update the Release Code.
2022.4.14 update Source file.

Release Code

The source code is open, please download from Source.

Please send an email to [email protected].

Running

1、Download model parameter and source codes

2、Resize the input image into 1024x960 (zooming in or out along the longest side and keeping the aspect ration, then filling zero for padding. )

3、Run python test.py --data_path_test=./dataset/shrink_1024_960/crop/

Training

Run python train.py

Dataset

The training dataset can be synthesised using the scripts.

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network.

Related tags

Overview

Dewarping Document Image

Dewarping Process

Compare

Notice

Release Code

Running

Training

Dataset

Owner

Repository for benchmarking graph neural networks

Toontown: Galaxy, a new Toontown game based on Disney's Toontown Online

Code for ICE-BeeM paper - NeurIPS 2020

Language models are open knowledge graphs ( non official implementation )

A PyTorch implementation of Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks

git《Joint Entity and Relation Extraction with Set Prediction Networks》(2020) GitHub:

Unsupervised Representation Learning via Neural Activation Coding

Hso-groupie - A pwnable challenge in Real World CTF 4th

A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis

Usable Implementation of "Bootstrap Your Own Latent" self-supervised learning, from Deepmind, in Pytorch

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Paddle implementation for "Cross-Lingual Word Embedding Refinement by ℓ1 Norm Optimisation" (NAACL 2021)

[CIKM 2019] Code and dataset for "Fi-GNN: Modeling Feature Interactions via Graph Neural Networks for CTR Prediction"

MlTr: Multi-label Classification with Transformer

Official PyTorch implementation of Segmenter: Transformer for Semantic Segmentation

Python wrapper class for OpenVINO Model Server. User can submit inference request to OVMS with just a few lines of code

Asynchronous Advantage Actor-Critic in PyTorch

[CVPR 2021] Exemplar-Based Open-Set Panoptic Segmentation Network (EOPSN)

Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. ICCV 2021.

Code for ACM MM 2020 paper "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination"