Code for Text Prior Guided Scene Text Image Super-Resolution

Last update: Dec 26, 2022

Related tags

Text Data & NLP TPGSR

Overview

Text Prior Guided Scene Text Image Super-Resolution

https://arxiv.org/abs/2106.15368

Jianqi Ma, Shi Guo, Lei Zhang
Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China

Recovering TextZoom samples

Environment:

Other possible python packages like pyyaml, cv2, Pillow and imgaug

Main idea

Single stage with loss

Multi-stage version

Configure your training

Download the pretrained recognizer from:

Aster: https://github.com/ayumiymk/aster.pytorch  
MORAN:  https://github.com/Canjie-Luo/MORAN_v2  
CRNN: https://github.com/meijieru/crnn.pytorch

Unzip the codes and walk into the '$TPGSR_ROOT$/', place the pretrained weights from recognizer in '$TPGSR_ROOT$/'.

Download the TextZoom dataset:

https://github.com/JasonBoy1/TextZoom

Train the corresponding model (e.g. TPGSR-TSRN):

chmod a+x train_TPGSR-TSRN.sh
./train_TPGSR-TSRN.sh
or
python3 main.py --arch="tsrn_tl_cascade" \       # The architecture
                --batch_size=48 \                # The batch size
                --STN \                          # Using STN net for alignment
		--mask \                         # Using the contour mask
		--use_distill \                  # Using the TP loss
		--gradient \                     # Using the Gradient Prior Loss
		--sr_share \                     # Sharing weights for SR Module
		--stu_iter=1 \                   # The number of interations in multi-stage version
		--vis_dir='vis_TPGSR-TSRN' \     # The checkpoint directory

Run the test-prefixed shell to test the corresponding model.

Adding '--go_test' in the shell file

Cite this paper:

@article{ma2021text,
title={Text Prior Guided Scene Text Image Super-resolution},
author={Ma, Jianqi and Guo, Shi and Zhang, Lei},
journal={arXiv preprint arXiv:2106.15368},
year={2021}
}

Code for Text Prior Guided Scene Text Image Super-Resolution

Related tags

Overview

Text Prior Guided Scene Text Image Super-Resolution

Recovering TextZoom samples

Environment:

Main idea

Single stage with loss

Multi-stage version

Configure your training

Download the pretrained recognizer from:

Download the TextZoom dataset:

Train the corresponding model (e.g. TPGSR-TSRN):

Run the test-prefixed shell to test the corresponding model.

Cite this paper:

Owner

IMS-Toucan is a toolkit to train state-of-the-art Speech Synthesis models

A machine learning model for analyzing text for user sentiment and determine whether its a positive, neutral, or negative review.

Analyse japanese ebooks using MeCab to determine the difficulty level for japanese learners

Implementation of TF-IDF algorithm to find documents similarity with cosine similarity

Biterm Topic Model (BTM): modeling topics in short texts

Textlesslib - Library for Textless Spoken Language Processing

Code for paper "Role-oriented Network Embedding Based on Adversarial Learning between Higher-order and Local Features"

Just Another Telegram Ai Chat Bot Written In Python With Pyrogram.

Transformer related optimization, including BERT, GPT

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

This is my reading list for my PhD in AI, NLP, Deep Learning and more.

A BERT-based reverse-dictionary of Korean proverbs

An implementation of WaveNet with fast generation

Auto-researching tool generating word documents.

Russian GPT3 models.

This repository is home to the Optimus data transformation plugins for various data processing needs.

Quantifiers and Negations in RE Documents

Codes to pre-train Japanese T5 models

Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17

Train and use generative text models in a few lines of code.