Towards Boosting the Accuracy of Non-Latin Scene Text Recognition

Last update: Aug 07, 2022

Related tags

Deep Learning NonLatinPhotoOCR

Overview

Convolutional Recurrent Neural Network + CTCLoss | STAR-Net

Code for paper "Towards Boosting the Accuracy of Non-Latin Scene Text Recognition"

Dependence

Python3.6.5
torch==1.2.0
torchvision==0.4.0
tensorboard==2.3.0

How to run the code?

Prepare data

Follow the instructions in meijieru/crnn.pytorch to create lmdb datasets. Use the same step to create train and val data.

Change parameters and alphabets

Please update the parameters and alphabets according to the requirement.

Change parameters in the mytrain.py file
Change alphabets

Please put all the alphabets that appear in your labels in a file and input the list as charlist to mytrain.py, else the program will throw an error during training.

Train

Run mytrain.py -

python3 mytrain.py --trainRoot /ssd_scratch/cvit/sanjana/hindi-train-lmdb \
--valRoot /ssd_scratch/cvit/sanjana/hindi-test-lmdb \
--arch crnn --lan hindi --charlist /ssd_scratch/cvit/sanjana/crnn_new/lexicon.txt \
--batchSize 32 --nepoch 15 --cuda --expr_dir /ssd_scratch/cvit/sanjana \
--displayInterval 10 --valInterval 100 --adadelta \ 
--manualSeed 1234 --random_sample --deal_with_lossnan

Reference

meijieru/crnn.pytorch
Sierkinhane/crnn_chinese_characters_rec

If you use the dataset or code from this work, please add the following citation:-

@inproceedings{gunnaNonLatin2021,
  title={Towards {B}oosting the {A}ccuracy of {N}on-{L}atin {S}cene {T}ext {R}ecognition,
  author={Sanjana Gunna and Rohit Saluja and C V Jawahar},
  booktitle={2021 International Conference on Document Analysis and Recognition Workshops (ICDARW)},
  year={2021},
  organization={IEEE}
}

Towards Boosting the Accuracy of Non-Latin Scene Text Recognition

Related tags

Overview

Convolutional Recurrent Neural Network + CTCLoss | STAR-Net

Dependence

How to run the code?

Prepare data

Change parameters and alphabets

Train

Reference

Owner

Sanjana Gunna

Stochastic gradient descent with model building

HyDiff: Hybrid Differential Software Analysis

Working demo of the Multi-class and Anomaly classification model using the CLIP feature space

Hepsiburada - Hepsiburada Urun Bilgisi Cekme

Computational Pathology Toolbox developed by TIA Centre, University of Warwick.

Uni-Fold: Training your own deep protein-folding models.

NumPy로 구현한 딥러닝 라이브러리입니다. (자동 미분 지원)

Code for WSDM 2022 paper, Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation.

An implementation of the BADGE batch active learning algorithm.

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)

Human4D Dataset tools for processing and visualization

DaReCzech is a dataset for text relevance ranking in Czech

Multi-Modal Machine Learning toolkit based on PaddlePaddle.

A Library for Modelling Probabilistic Hierarchical Graphical Models in PyTorch

Python and Julia in harmony.

Code & Models for Temporal Segment Networks (TSN) in ECCV 2016

R interface to fast.ai

The official repository for "Revealing unforeseen diagnostic image features with deep learning by detecting cardiovascular diseases from apical four-chamber ultrasounds"

Here I will explain the flow to deploy your custom deep learning models on Ultra96V2.