Fastseq

基于ONNXRUNTIME的文本生成加速框架

1. 环境配置

# 创建onnx conda环境
conda create -n onnx_py38 python=3.8
conda activate onnx_py38
conda install pytorch cudatoolkit=10.2 -c pytorch

# 安装onnxruntime-gpu(目前只有1.5.2版本测试成功)
pip install onnxruntime-gpu==1.5.2

# 安装transformers==3.1.0版本
pip install transformers==3.1.0

2. ONNX转换

# 将huggingface保存的 模型/checkpoint 转换为onnx格式。这里使用onnxruntime自带的转换工具。
python -m onnxruntime.transformers.convert_to_onnx \
    -m "path_to_checkpoint/model_name(gpt2)" \
    --model_class GPT2LMHeadModel \
    --output gpt2_fp32.onnx \
    -p fp32

3. DEMO测试

CUDA_VISIBLE_DEVICES=3 python demo.py \
    --onnx_model_path "./gpt2_fp32.onnx" \
    --model_name_or_path "path_to_checkpoint" \
    --prompt_text "here is an example of gpt2 model" \
    --do_sample_top_k 5

Fastseq 基于ONNXRUNTIME的文本生成加速框架

Related tags

Overview

Fastseq

1. 环境配置

2. ONNX转换

3. DEMO测试

4. TODO

Owner

Jun Gao

This project uses unsupervised machine learning to identify correlations between daily inoculation rates in the USA and twitter sentiment in regards to COVID-19.

숭실대학교 컴퓨터학부 전공종합설계프로젝트

Topic Modelling for Humans

Deploying a Text Summarization NLP use case on Docker Container Utilizing Nvidia GPU

multi-label，classifier，text classification，多标签文本分类，文本分类，BERT，ALBERT，multi-label-classification，seq2seq，attention，beam search

Türkçe küfürlü içerikleri bulan bir yapay zeka kütüphanesi / An ML library for profanity detection in Turkish sentences

A PyTorch implementation of the Transformer model in "Attention is All You Need".

BookNLP, a natural language processing pipeline for books

Develop open-source Python Arabic NLP libraries that the Arab world will easily use in all Natural Language Processing applications

MRC approach for Aspect-based Sentiment Analysis (ABSA)

Built for cleaning purposes in military institutions

The implementation of Parameter Differentiation based Multilingual Neural Machine Translation

TextFlint is a multilingual robustness evaluation platform for natural language processing tasks,

Code for using and evaluating SpanBERT.

SentimentArcs: a large ensemble of dozens of sentiment analysis models to analyze emotion in text over time

A paper list of pre-trained language models (PLMs).

Knowledge Graph,Question Answering System，基于知识图谱和向量检索的医疗诊断问答系统

Tracking Progress in Natural Language Processing

Just Another Telegram Ai Chat Bot Written In Python With Pyrogram.

Code for producing Japanese GPT-2 provided by rinna Co., Ltd.