A high-level yet extensible library for fast language model tuning via automatic prompt search

Last update: Dec 07, 2022

Related tags

Overview

ruPrompts

ruPrompts is a high-level yet extensible library for fast language model tuning via automatic prompt search, featuring integration with HuggingFace Hub, configuration system powered by Hydra, and command line interface.

Prompt is a text instruction for language model, like

Translate English to French:
cat =>

For some tasks the prompt is obvious, but for some it isn't. With ruPrompts you can define only the prompt format, like {text}, and train it automatically for any task, if you have a training dataset.

You can currently use ruPrompts for text-to-text tasks, such as summarization, detoxification, style transfer, etc., and for styled text generation, as a special case of text-to-text.

Features

Modular structure for convenient extensibility
Integration with HF Transformers, support for all models with LM head
Integration with HF Hub for sharing and loading pretrained prompts
CLI and configuration system powered by Hydra
Pretrained prompts for ruGPT-3

Installation

ruPrompts can be installed with pip:

pip install ruprompts[hydra]

See Installation for other installation options.

Usage

Loading a pretrained prompt for styled text generation:

>> ppln_joke("Говорит кружка ложке") [{"generated_text": 'Говорит кружка ложке: "Не бойся, не утонешь!".'}]">

>>> import ruprompts
>>> from transformers import pipeline

>>> ppln_joke = pipeline("text-generation-with-prompt", prompt="konodyuk/prompt_rugpt3large_joke")
>>> ppln_joke("Говорит кружка ложке")
[{"generated_text": 'Говорит кружка ложке: "Не бойся, не утонешь!".'}]

For text2text tasks:

>> ppln_detox("Опять эти тупые дятлы все испортили, чтоб их черти взяли") [{"generated_text": 'Опять эти люди все испортили'}]">

>>> ppln_detox = pipeline("text2text-generation-with-prompt", prompt="konodyuk/prompt_rugpt3large_detox_russe")
>>> ppln_detox("Опять эти тупые дятлы все испортили, чтоб их черти взяли")
[{"generated_text": 'Опять эти люди все испортили'}]

Proceed to Quick Start for a more detailed introduction or start using ruPrompts right now with our Colab Tutorials.

License

ruPrompts is Apache 2.0 licensed. See the LICENSE file for details.

A high-level yet extensible library for fast language model tuning via automatic prompt search

Related tags

Overview

ruPrompts

Features

Installation

Usage

License

Owner

Sber AI

Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

A Paper List for Speech Translation

Document processing using transformers

In this workshop we will be exploring NLP state of the art transformers, with SOTA models like T5 and BERT, then build a model using HugginFace transformers framework.

초성 해석기 based on ko-BART

official ( API ) for the zAmericanEnglish app in [ Google play ] and [ App store ]

Kurumi ChatBot

Telegram AI chat bot written in Python using Pyrogram

Multilingual text (NLP) processing toolkit

The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques

Simple GUI where you can enter an article and get a crisp summarized version.

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

LSTM based Sentiment Classification using Tensorflow - Amazon Reviews Rating

Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch

A Flask Sentiment Analysis API, with visual implementation

Course project of [email protected]

Weird Sort-and-Compress Thing

Constituency Tree Labeling Tool

SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples

BERN2: an advanced neural biomedical namedentity recognition and normalization tool