Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

Last update: Dec 30, 2022

Related tags

Computer Vision TableNet

Overview

TableNet

Unofficial implementation of ICDAR 2019 paper : TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images.

Paper

Overview

Paper: TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images

TableNet is a modern deep learning architecture that was proposed by a team from TCS Research year in the year 2019. The main motivation was to extract information from scanned tables through mobile phones or cameras.

They proposed a solution that includes accurate detection of the tabular region within an image and subsequently detecting and extracting information from the rows and columns of the detected table.

Architecture: The architecture is based out of Long et al., an encoder-decoder model for semantic segmentation. The same encoder/decoder network is used as the FCN architecture for table extraction. The images are preprocessed and modified using the Tesseract OCR.

Source: Nanonets

How to run

pip install -r requirements.txt

Download the Marmot Dataset from the link given in readme.
Run data_preprocess/generate_mask.py to generate Table and Column Mask of corresponding images.
Follow the TableNet.ipynb notebook to train and test the model.

Challenges

Require a very decent System with a good GPU for accurate result on High pixel images.

Dataset

Download the dataset provided in paper : Marmot Dataset.

Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

Related tags

Overview

TableNet

Overview

How to run

Challenges

Dataset

Owner

Jainam Shah

The official code for the ICCV-2021 paper "Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates".

Python-based tools for document analysis and OCR

Table recognition inside douments using neural networks

Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)

Virtualdragdrop - Virtual Drag and Drop Using OpenCV and Arduino

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

3点クリックで円を指定し、極座標変換を行うサンプルプログラム

Use Youdao OCR API to covert your clipboard image to text.

End-to-end pipeline for real-time scene text detection and recognition.

Assignment work with webcam

Amazing 3D explosion animation using Pygame module.

A curated list of resources dedicated to scene text localization and recognition

SemTorch

Dataset and Code for ICCV 2021 paper "Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme"

Handwritten_Text_Recognition

Just a script for detecting the lanes in any car game (not just gta 5) with specific resolution and road design ( very basic and limited )

A Python wrapper for the tesseract-ocr API

原神风花节自动弹琴辅助

Memory tests solver with using OpenCV

This repository contains codes on how to handle mouse event using OpenCV