Character Segmentation using TensorFlow

Last update: Aug 25, 2022

Related tags

Overview

Character Segmentation

Segment characters and spaces in one text line,from this paper Chinese English mixed Character Segmentation as Semantic Segmentation

dependencies

tensorflow1.3,or 1.4

python3

differences from the paper

the paper set the label of the space to 1,others 0.But that is not hommizate,because the space between two characters is many pixes,the network is hard to distinguish which is 1,which is 0,even though it can work.Here we change to set the characters to 1,spaces to 0.

architecture of the network

Heuristic Rules for balanced_Binary_CrossEntropy

make training images and labels

python3 make_train_images.py

train

python3 train_char_seg.py

test

python3 test_char_seg.py

other_things

you can choose first make traing images and then use these maked images to train ,or training and making at the same time.all you need to do is change below codes in data_generator.py

enqueuer = GeneratorEnqueuer(generator_on_the_fly(**kwargs), use_multiprocessing=False)
#enqueuer = GeneratorEnqueuer(generator_from_folder(**kwargs), use_multiprocessing=False)

Character Segmentation using TensorFlow

Related tags

Overview

Character Segmentation

dependencies

differences from the paper

architecture of the network

Heuristic Rules for balanced_Binary_CrossEntropy

make training images and labels

train

test

other_things

Owner

Text modding tools for FF7R (Final Fantasy VII Remake)

Sort By Face

Responsive Doc. scanner using U^2-Net, Textcleaner and Tesseract

Automatically download multiple papers by keywords in CVPR

FastOCR is a desktop application for OCR API.

This repository contains the code for the paper "SCANimate: Weakly Supervised Learning of Skinned Clothed Avatar Networks"

Python Computer Vision from Scratch

CVPR 2021 Oral paper "LED2-Net: Monocular 360˚ Layout Estimation via Differentiable Depth Rendering" official PyTorch implementation.

QED-C: The Quantum Economic Development Consortium provides these computer programs and software for use in the fields of quantum science and engineering.

This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"

a deep learning model for page layout analysis / segmentation.

Distort a video using Seam Carving (video) and Vibrato effect (sound)

Implementation of EAST scene text detector in Keras

Demo processor to illustrate OCR-D Python API

This is the official PyTorch implementation of the paper "TransFG: A Transformer Architecture for Fine-grained Recognition" (Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang, Alan Yuille).

Recognizing the text contents from a scanned visiting card

list all open dataset about ocr.

Volume Control using OpenCV

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments