poutyne-transformers

Train 🤗 -transformers models with Poutyne.

Installation

pip install poutyne-transformers

Example

import torch
from transformers import AutoModelForSequenceClassification, AutoTokenizer
from datasets import load_dataset
from torch.utils.data import DataLoader
from torch import optim
from poutyne import Model
from poutyne_transformers import TransformerCollator, model_loss, ModelWrapper

print('Loading model & tokenizer.')
transformer = AutoModelForSequenceClassification.from_pretrained('distilbert-base-cased', num_labels=2, return_dict=True)
tokenizer = AutoTokenizer.from_pretrained('distilbert-base-cased')

print('Loading & preparing dataset.')
dataset = load_dataset("imdb")
dataset = dataset.map(lambda entry: tokenizer(entry['text'], add_special_tokens=True, padding='max_length', truncation=True), batched=True)
dataset = dataset.remove_columns(['text'])
dataset.set_format('torch')

collate_fn = TransformerCollator()
train_dataloader = DataLoader(dataset['train'], batch_size=16, collate_fn=collate_fn)
test_dataloader = DataLoader(dataset['test'], batch_size=16, collate_fn=collate_fn)

print('Preparing training.')
wrapped_transformer = ModelWrapper(transformer)
optimizer = optim.AdamW(wrapped_transformer.parameters(), lr=5e-5)
device = torch.device('cuda:0' if torch.cuda.is_available() else "cpu")
model = Model(wrapped_transformer, optimizer, loss_function=model_loss, device=device)

print('Starting training.')
model.fit_generator(train_dataloader, test_dataloader, epochs=1)

Train 🤗-transformers model with Poutyne.

Related tags

Overview

poutyne-transformers

Installation

Example

Owner

Lennart Keller

This repository contains the code for "Generating Datasets with Pretrained Language Models".

The FinQA dataset from paper: FinQA: A Dataset of Numerical Reasoning over Financial Data

Code-autocomplete, a code completion plugin for Python

Write Python in Urdu - اردو میں کوڈ لکھیں

This repository contains the code, models and datasets discussed in our paper "Few-Shot Question Answering by Pretraining Span Selection"

Yet Another Neural Machine Translation Toolkit

Contact Extraction with Question Answering.

An attempt to map the areas with active conflict in Ukraine using open source twitter data.

The (extremely) naive sentiment classification function based on NBSVM trained on wisesight_sentiment

Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)

IMDB film review sentiment classification based on BERT's supervised learning model.

Sentello is python script that simulates the anti-evasion and anti-analysis techniques used by malware.

An easier way to build neural search on the cloud

A crowdsourced dataset of dialogues grounded in social contexts involving utilization of commonsense.

A fast, efficient universal vector embedding utility package.

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

txtai: Build AI-powered semantic search applications in Go

The RWKV Language Model

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

This is a simple item2vec implementation using gensim for recbole