A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Last update: Jan 05, 2023

Related tags

Overview

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Jianqi Ma, Zhetong Liang, Lei Zhang
Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China & OPPO Research

Recovering TextZoom samples

Environment:

Other possible python packages like pyyaml, cv2, Pillow and imgaug

Main idea

The pipeline

TP Interpreter

Configure your training

Download the pretrained recognizer from:

Aster: https://github.com/ayumiymk/aster.pytorch  
MORAN:  https://github.com/Canjie-Luo/MORAN_v2  
CRNN: https://github.com/meijieru/crnn.pytorch

Unzip the codes and walk into the ' $TATT_ROOT$ /', place the pretrained weights from recognizer in ' $TATT_ROOT$ /'.

Download the TextZoom dataset:

https://github.com/JasonBoy1/TextZoom

Train the corresponding model (e.g. TPGSR-TSRN):

chmod a+x train_TATT.sh
./train_TATT.sh

Run the test-prefixed shell to test the corresponding model.

Adding '--go_test' in the shell file

Cite this paper:

@article{ma2021text,
title={A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution},
author={Ma, Jianqi and Zhetong, Liang and Zhang, Lei},
journal={},
year={2022}
}

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Related tags

Overview

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Recovering TextZoom samples

Environment:

Main idea

The pipeline

TP Interpreter

Configure your training

Download the pretrained recognizer from:

Download the TextZoom dataset:

Train the corresponding model (e.g. TPGSR-TSRN):

Run the test-prefixed shell to test the corresponding model.

Cite this paper:

Owner

MA Jianqi, shiki

On the model-based stochastic value gradient for continuous reinforcement learning

MLOps will help you to understand how to build a Continuous Integration and Continuous Delivery pipeline for an ML/AI project.

The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"

When are Iterative GPs Numerically Accurate?

Official repository for Fourier model that can generate periodic signals

Intel® Neural Compressor is an open-source Python library running on Intel CPUs and GPUs

Release of SPLASH: Dataset for semantic parse correction with natural language feedback in the context of text-to-SQL parsing

Clairvoyance: a Unified, End-to-End AutoML Pipeline for Medical Time Series

Unified unsupervised and semi-supervised domain adaptation network for cross-scenario face anti-spoofing, Pattern Recognition

SMCA replication There are no extra compiled components in SMCA DETR and package dependencies are minimal

Deep Inside Convolutional Networks - This is a caffe implementation to visualize the learnt model

Employee-Managment - Company employee registration software in the face recognition system

Model Zoo for AI Model Efficiency Toolkit

Low-code/No-code approach for deep learning inference on devices

FCOSR: A Simple Anchor-free Rotated Detector for Aerial Object Detection

LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

Hack Camera, Microphone, Location, Clipboard With Just a Link. Also, Get Many Details About Victim's Device. And So On...

[CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach

Neighborhood Contrastive Learning for Novel Class Discovery

A Comprehensive Study on Learning-Based PE Malware Family Classification Methods