Contextual Attention Localization for Offline Handwritten Text Recognition

Last update: Feb 17, 2022

Related tags

Overview

CALText

This repository contains the source code for CALText model introduced in "CALText: Contextual Attention Localization for Offline Handwritten Text" paper. The details of this model are presented in: (Add paper link)

Samples of the datasets that were used to train and test the model can be found at: http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/pucit_ohul_dataset.html

The code in this model was based on the work of:

https://github.com/JianshuZhang/WAP.

https://github.com/wwjwhen/Watch-Attend-and-Parse-tensorflow-version.

Requirements

Python 3 Tensorflow v1.6

Usage

Upload data files into your Colab account, create pickle files (train, valid, and test images and labels) from the dataset. You can place the pickle dataset files at any folder of your preference but change the path settings in the code where this data is being loaded.

Run "makepickle.ipynb" to create pickle files for train and test data. Further distribute the train pickle file into train and valid pickle files by using last 907 images and labels of train as valid.

For training, set mode="train", and run "CALText.ipynb".

For testing, set mode="test", and run "CALText.ipynb".

For Contextual Attention, set alpha_reg=0, while training and testing.

For Contextual Attention Localization, set alpha_reg=1, while training and testing.

Run on Python Compiler

To run the code on python compiler, copy the code and make file as "makepickle.py" and "CALText.py". Use following commands to run code files.

python makepickle.py

python CALText.py

Run on Google Colab

Open "makepickle.ipynb" and "CALText.ipynb" notebook in Google Colab Notebook, and run.

Run "%tensorflow_version 1.x" command at colab notebook before running of "CALText.ipynb".

Change runtime to GPU or TPU for better performance.

Add these lines in notebook for accessing data from google derive:

from google.colab import drive

drive.mount("/gdrive", force_remount=True)

References

PUCIT Offline Handwritten Urdu Lines (PUCIT-OHUL) Dataset: http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/pucit_ohul_dataset.html

Previous Work:

http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/index.html

http://faculty.pucit.edu.pk/nazarkhan/work/urdu_ohtr/ICFHR2020_manuscript.pdf

Contextual Attention Localization for Offline Handwritten Text Recognition

Related tags

Overview

CALText

Requirements

Usage

Run on Python Compiler

Run on Google Colab

References

Owner

Torchlight2 lan game server tool - A message forwarding tool for Torchlight 2 lan game

Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)

Generative Adversarial Text-to-Image Synthesis

[ICLR'21] FedBN: Federated Learning on Non-IID Features via Local Batch Normalization

Python Classes: Medical Insurance Project using Object Oriented Programming Concepts

Using Convolutional Neural Networks (CNN) for Semantic Segmentation of Breast Cancer Lesions (BRCA)

Segmentation for medical image.

Official implementation of "Variable-Rate Deep Image Compression through Spatially-Adaptive Feature Transform", ICCV 2021

Angular & Electron desktop UI framework. Angular components for native looking and behaving macOS desktop UI (Electron/Web)

As-ViT: Auto-scaling Vision Transformers without Training

A curated list of awesome Machine Learning frameworks, libraries and software.

Code release for "BoxeR: Box-Attention for 2D and 3D Transformers"

FasterAI: A library to make smaller and faster models with FastAI.

This is the repository for our paper Ditch the Gold Standard: Re-evaluating Conversational Question Answering

Robotics with GPU computing

Single-stage Keypoint-based Category-level Object Pose Estimation from an RGB Image

New approach to benchmark VQA models

Implementing yolov4 target detection and tracking based on nao robot

利用yolov5和TensorRT从0到1实现目标检测的模型训练到模型部署全过程

Python scripts for performing stereo depth estimation using the MobileStereoNet model in Tensorflow Lite.