Styled Handwritten Text Generation with Transformers (ICCV 21)

Last update: Dec 22, 2022

Overview

⚡ Handwriting Transformers [PDF]

Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan & Mubarak Shah

Abstract: We propose a novel transformer-based styled handwritten text image generation approach, HWT, that strives to learn both style-content entanglement as well as global and local writing style patterns. The proposed HWT captures the long and short range relationships within the style examples through a self-attention mechanism, thereby encoding both global and local style patterns. Further, the proposed transformer-based HWT comprises an encoder-decoder attention that enables style-content entanglement by gathering the style representation of each query character. To the best of our knowledge, we are the first to introduce a transformer-based generative network for styled handwritten text generation. Our proposed HWT generates realistic styled handwritten text images and significantly outperforms the state-of-the-art demonstrated through extensive qualitative, quantitative and human-based evaluations. The proposed HWT can handle arbitrary length of text and any desired writing style in a few-shot setting. Further, our HWT generalizes well to the challenging scenario where both words and writing style are unseen during training, generating realistic styled handwritten text images.

Software environment

Python 3.7
PyTorch >=1.4

Setup & Training

Please see INSTALL.md for installing required libraries. You can change the content in the file mytext.txt to visualize generated handwriting while training.

Citation

If you use the code for your research, please cite our paper:

@InProceedings{Bhunia_2021_ICCV,
    author    = {Bhunia, Ankan Kumar and Khan, Salman and Cholakkal, Hisham and Anwer, Rao Muhammad and Khan, Fahad Shahbaz and Shah, Mubarak},
    title     = {Handwriting Transformers},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {1086-1094}
}

Styled Handwritten Text Generation with Transformers (ICCV 21)

Related tags

Overview

⚡ Handwriting Transformers [PDF]

Software environment

Setup & Training

Citation

Owner

Ankan Kumar Bhunia

Markov Attention Models

Bounding Wasserstein distance with couplings

《Geo Word Clouds》paper implementation

Towards the D-Optimal Online Experiment Design for Recommender Selection (KDD 2021)

BMN: Boundary-Matching Network

Deep Learning Models for Causal Inference

Galaxy images labelled by morphology (shape). Aimed at ML development and teaching

The NEOSSat is a dual-mission microsatellite designed to detect potentially hazardous Earth-orbit-crossing asteroids and track objects that reside in deep space

Catalyst.Detection

Demystifying How Self-Supervised Features Improve Training from Noisy Labels

source code for https://arxiv.org/abs/2005.11248 "Accelerating Antimicrobial Discovery with Controllable Deep Generative Models and Molecular Dynamics"

Predict Breast Cancer Wisconsin (Diagnostic) using Naive Bayes

[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

Face and Pose detector that emits MQTT events when a face or human body is detected and not detected.

The open source code of SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation.

Storchastic is a PyTorch library for stochastic gradient estimation in Deep Learning

Point detection through multi-instance deep heatmap regression for sutures in endoscopy

SegNet model implemented using keras framework

In this project we combine techniques from neural voice cloning and musical instrument synthesis to achieve good results from as little as 16 seconds of target data.