Fastseq

基于ONNXRUNTIME的文本生成加速框架

1. 环境配置

# 创建onnx conda环境
conda create -n onnx_py38 python=3.8
conda activate onnx_py38
conda install pytorch cudatoolkit=10.2 -c pytorch

# 安装onnxruntime-gpu(目前只有1.5.2版本测试成功)
pip install onnxruntime-gpu==1.5.2

# 安装transformers==3.1.0版本
pip install transformers==3.1.0

2. ONNX转换

# 将huggingface保存的 模型/checkpoint 转换为onnx格式。这里使用onnxruntime自带的转换工具。
python -m onnxruntime.transformers.convert_to_onnx \
    -m "path_to_checkpoint/model_name(gpt2)" \
    --model_class GPT2LMHeadModel \
    --output gpt2_fp32.onnx \
    -p fp32

3. DEMO测试

CUDA_VISIBLE_DEVICES=3 python demo.py \
    --onnx_model_path "./gpt2_fp32.onnx" \
    --model_name_or_path "path_to_checkpoint" \
    --prompt_text "here is an example of gpt2 model" \
    --do_sample_top_k 5

Fastseq 基于ONNXRUNTIME的文本生成加速框架

Related tags

Overview

Fastseq

1. 环境配置

2. ONNX转换

3. DEMO测试

4. TODO

Owner

Jun Gao

Legal text retrieval for python

A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/casual, active/passive, and many more. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

A notebook that shows how to import the IITB English-Hindi Parallel Corpus from the HuggingFace datasets repository

code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"

TTS is a library for advanced Text-to-Speech generation.

A highly sophisticated sequence-to-sequence model for code generation

用Resnet101+GPT搭建一个玩王者荣耀的AI

Autoregressive Entity Retrieval

Just a Basic like Language for Zeno INC

Score-Based Point Cloud Denoising (ICCV'21)

Baseline code for Korean open domain question answering(ODQA)

AI-powered literature discovery and review engine for medical/scientific papers

Gpt2-WebAPI - The objective of this API is to provide the 3 best possible responses to sentences that the user would input via http GET request as a parameter

Türkçe küfürlü içerikleri bulan bir yapay zeka kütüphanesi / An ML library for profanity detection in Turkish sentences

Code-autocomplete, a code completion plugin for Python

Paradigm Shift in NLP - "Paradigm Shift in Natural Language Processing".

A Fast Sequence Transducer Implementation with PyTorch Bindings

Unsupervised Document Expansion for Information Retrieval with Stochastic Text Generation

MRC approach for Aspect-based Sentiment Analysis (ABSA)

Under the hood working of transformers, fine-tuning GPT-3 models, DeBERTa, vision models, and the start of Metaverse, using a variety of NLP platforms: Hugging Face, OpenAI API, Trax, and AllenNLP