A demo for end-to-end English and Chinese text spotting using ABCNet.

Last update: Oct 04, 2022

Related tags

Overview

ABCNet_Chinese

A demo for end-to-end English and Chinese text spotting using ABCNet. This is an old model that was trained a long ago, which serves as a base setting for others to train their own model on Chinese or other language. Official ABCNet_v2 models will be updated in AdelaiDet.

Installation

Install detectron2 using the provided version (support visualizing Chinese text):

python -m pip install -e d2

Install this repo:

python setup.py build develop

If the above succeed, you can now run the demo using the provided model.

Model

This is our model that can be used for evaluation or pretraining.

wget https://drive.google.com/file/d/1iWX2n_BmyltVwQmfj8_oM9z7cJlq1P0m/view?usp=sharing -O model_chn.pth

Simply put the model in the root directory of the repo.

Demo

bash demo.sh

Example results

If you successfully run the demo, you will get the output below:

Other results (same project but not using the provide model):

Document-like Ancient words, e.g., “彝文”:

Cite

If you find this repo useful, please cite:

@article{liu2021abcnet,
  title={ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting},
  author={Liu, Yuliang and Shen, Chunhua and Jin, Lianwen and He, Tong and Chen, Peng and Liu, Chongyu and Chen, Hao},
  journal={arXiv preprint arXiv:2105.03620},
  year={2021}
}

Data

We provide the converted json files of ArT, LSVT, and ReCTS that we have used for training ABCNet_Chinese.

ReCTs [images&label](1.7G) [Origin_of_dataset]
LSVT [images&label](8.2G) [Origin_of_dataset]
ArT [images&label](1.5G) [Origin_of_dataset]
SynChinese130k [images&label](25G) [Origin_of_dataset]

License

For academic use, this project is licensed under the 2-clause BSD License - see the LICENSE file for details. For commercial use, please contact Chunhua Shen.

A demo for end-to-end English and Chinese text spotting using ABCNet.

Related tags

Overview

ABCNet_Chinese

Installation

Model

Demo

Example results

Cite

Data

License

Owner

Yuliang Liu

2021海华AI挑战赛·中文阅读理解·技术组·第三名

Code for papers "Generation-Augmented Retrieval for Open-Domain Question Answering" and "Reader-Guided Passage Reranking for Open-Domain Question Answering", ACL 2021

Rich Prosody Diversity Modelling with Phone-level Mixture Density Network

⚖️ A Statutory Article Retrieval Dataset in French.

A relatively simple python program to generate one of those reddit text to speech videos dominating youtube.

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

Build Text Rerankers with Deep Language Models

This repository details the steps in creating a Part of Speech tagger using Trigram Hidden Markov Models and the Viterbi Algorithm without using external libraries.

Idea is to build a model which will take keywords as inputs and generate sentences as outputs.

Code for the paper "Language Models are Unsupervised Multitask Learners"

Generating new names based on trends in data using GPT2 (Transformer network)

A library for Multilingual Unsupervised or Supervised word Embeddings

Easy-to-use CPM for Chinese text generation

Adversarial Examples for Extreme Multilabel Text Classification

Module for automatic summarization of text documents and HTML pages.

Nested Named Entity Recognition

An open-source NLP research library, built on PyTorch.

📝An easy-to-use package to restore punctuation of the text.

Multilingual text (NLP) processing toolkit

Journey is a NLP-Powered Developer assistant