Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

Last update: Jan 04, 2023

Related tags

Text Data & NLP PABEE

Overview

Patience-based Early Exit

Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

NEWS: We now have a better and tidier implementation integrated into Hugging Face transformers!

Citation

If you use this code in your research, please cite our paper:

@inproceedings{zhou2020bert,
 author = {Zhou, Wangchunshu and Xu, Canwen and Ge, Tao and McAuley, Julian and Xu, Ke and Wei, Furu},
 booktitle = {Advances in Neural Information Processing Systems},
 pages = {18330--18341},
 publisher = {Curran Associates, Inc.},
 title = {BERT Loses Patience: Fast and Robust Inference with Early Exit},
 url = {https://proceedings.neurips.cc/paper/2020/file/d4dd111a4fd973394238aca5c05bebe3-Paper.pdf},
 volume = {33},
 year = {2020}
}

Requirement

Our code is built on huggingface/transformers. To use our code, you must clone and install huggingface/transformers.

Training

You can fine-tune a pretrained language model and train the internal classifiers by configuring and running finetune_bert.sh and finetune_albert.sh .

Inference

You can inference with different patience settings by configuring and running patience_infer_albert.sh and patience_infer_bert.sh.

Bug Report and Contribution

If you'd like to contribute and add more tasks (only GLUE is available at this moment), please submit a pull request and contact me. Also, if you find any problem or bug, please report with an issue. Thanks!

Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".

Related tags

Overview

Patience-based Early Exit

Citation

Requirement

Training

Inference

Bug Report and Contribution

Owner

Kevin Canwen Xu

使用Mask LM预训练任务来预训练Bert模型。训练垂直领域语料的模型表征，提升下游任务的表现。

The guide to tackle with the Text Summarization

Code examples for my Write Better Python Code series on YouTube.

Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)

Large-scale Knowledge Graph Construction with Prompting

自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器

Dé op-de-vlucht Pieton vertaler. Wereldwijd gebruikt door meer dan 1.000+ succesvolle bedrijven!

Code from the paper "High-Performance Brain-to-Text Communication via Handwriting"

Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch

MASS: Masked Sequence to Sequence Pre-training for Language Generation

Codename generator using WordNet parts of speech database

Modified GPT using average pooling to reduce the softmax attention memory constraints.

Conditional probing: measuring usable information beyond a baseline

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS)

All the code I wrote for Overwatch-related projects that I still own the rights to.

Athena is an open-source implementation of end-to-end speech processing engine.

Main repository for the chatbot Bobotinho.

nlp-tutorial is a tutorial for who is studying NLP(Natural Language Processing) using Pytorch

A demo for end-to-end English and Chinese text spotting using ABCNet.

🌐 Translation microservice powered by AI