Demo BERT ONNX pipeline written in rust

This demo showcase the use of onnxruntime-rs with a GPU on CUDA 11 to run Bert in a data pipeline with Rust.

Requirement

Linux x86_64
NVIDIA GPU with CUDA 11 (Not sure if CUDA 10 works)
Rust (obviously)
git lfs. Install it with https://git-lfs.github.com/

training and converting to ONNX (Python)

Installation

Download the data https://www.kaggle.com/c/predict-closed-questions-on-stack-overflow/data?select=train_October_9_2012.csv -> train_October_9_2012.csv and put it in the folder training.

cd training
pip install -r requirement.txt

Run

python pytorch-model.py

I have not done any parameter search to find an optimal model as it is not the point of this repo.

I have limited the training to 500 rows to iterate faster.

Inference using Python

cd src
python python_alternative.py

Inference using Rust

Installation

export ORT_USE_CUDA=1
cargo build --release

Run

cargo run --release

or after build:

export LD_LIBRARY_PATH=path/to/onnxruntime-linux-x64-gpu-1.8.0/lib:${LD_LIBRARY_PATH}
./target/release/machine-learning

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
src		src
training		training
.gitattributes		.gitattributes
.gitignore		.gitignore
Cargo.toml		Cargo.toml
README.md		README.md
grace_hopper.jpg		grace_hopper.jpg
medium.csv		medium.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

training

training

.gitattributes

.gitattributes

.gitignore

.gitignore

Cargo.toml

Cargo.toml

README.md

README.md

grace_hopper.jpg

grace_hopper.jpg

medium.csv

medium.csv

Repository files navigation

Demo BERT ONNX pipeline written in rust

Requirement

training and converting to ONNX (Python)

Installation

Run

Inference using Python

Inference using Rust

Installation

Run

About

Releases

Packages

Contributors 2

Languages

haixuanTao/bert-onnx-rs-pipeline

Folders and files

Latest commit

History

Repository files navigation

Demo BERT ONNX pipeline written in rust

Requirement

training and converting to ONNX (Python)

Installation

Run

Inference using Python

Inference using Rust

Installation

Run

About

Resources

Stars

Watchers

Forks

Languages