A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Last update: Jan 05, 2023

Related tags

Overview

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Jianqi Ma, Zhetong Liang, Lei Zhang
Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China & OPPO Research

Recovering TextZoom samples

Environment:

Other possible python packages like pyyaml, cv2, Pillow and imgaug

Main idea

The pipeline

TP Interpreter

Configure your training

Download the pretrained recognizer from:

Aster: https://github.com/ayumiymk/aster.pytorch  
MORAN:  https://github.com/Canjie-Luo/MORAN_v2  
CRNN: https://github.com/meijieru/crnn.pytorch

Unzip the codes and walk into the ' $TATT_ROOT$ /', place the pretrained weights from recognizer in ' $TATT_ROOT$ /'.

Download the TextZoom dataset:

https://github.com/JasonBoy1/TextZoom

Train the corresponding model (e.g. TPGSR-TSRN):

chmod a+x train_TATT.sh
./train_TATT.sh

Run the test-prefixed shell to test the corresponding model.

Adding '--go_test' in the shell file

Cite this paper:

@article{ma2021text,
title={A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution},
author={Ma, Jianqi and Zhetong, Liang and Zhang, Lei},
journal={},
year={2022}
}

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Related tags

Overview

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Recovering TextZoom samples

Environment:

Main idea

The pipeline

TP Interpreter

Configure your training

Download the pretrained recognizer from:

Download the TextZoom dataset:

Train the corresponding model (e.g. TPGSR-TSRN):

Run the test-prefixed shell to test the corresponding model.

Cite this paper:

Owner

MA Jianqi, shiki

Repository for the semantic WMI loss

This is an official implementation of the paper "Distance-aware Quantization", accepted to ICCV2021.

Adapter-BERT: Parameter-Efficient Transfer Learning for NLP.

Official Implementation of SWAGAN: A Style-based Wavelet-driven Generative Model

Improving XGBoost survival analysis with embeddings and debiased estimators

Code for our NeurIPS 2021 paper Mining the Benefits of Two-stage and One-stage HOI Detection

An intelligent, flexible grammar of machine learning.

Manipulation OpenAI Gym environments to simulate robots at the STARS lab

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with ONNX, TensorRT, ncnn, and OpenVINO supported.

DARTS-: Robustly Stepping out of Performance Collapse Without Indicators

SporeAgent: Reinforced Scene-level Plausibility for Object Pose Refinement

The code for paper "Learning Implicit Fields for Generative Shape Modeling".

[AAAI-2022] Official implementations of MCL: Mutual Contrastive Learning for Visual Representation Learning

A Library for Modelling Probabilistic Hierarchical Graphical Models in PyTorch

Computational Pathology Toolbox developed by TIA Centre, University of Warwick.

[ICCV'21] UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction

Parallel Latent Tree-Induction for Faster Sequence Encoding

Safe Policy Optimization with Local Features

Pure python implementations of popular ML algorithms.

The repository contain code for building compiler using puthon.