This is a library for training and applying sparse fine-tunings with torch and transformers.

Last update: Dec 30, 2022

Related tags

Overview

This is a library for training and applying sparse fine-tunings with torch and transformers. Please refer to our paper Composable Sparse Fine-Tuning for Cross Lingual Transfer for background.

Installation

First, install Python 3.9 and PyTorch >= 1.9 (earlier versions may work but haven't been tested), e.g. using conda:

conda create -n sft python=3.9
conda activate sft
conda install pytorch cudatoolkit=11.1 -c pytorch -c conda-forge

Then download and install composable-sft:

git clone https://github.com/cambridgeltl/composable-sft.git
cd composable-sft
pip install -e .

Using pre-trained SFTs

Pre-trained SFTs can be downloaded directly and applied to models as follows:

from transformers import AutoConfig, AutoModelForTokenClassification
from sft import SFT

config = AutoConfig.from_pretrained(
    'bert-base-multilingual-cased',
    num_labels=17,
)

model = AutoModelForTokenClassification.from_pretrained(
    'bert-base-multilingual-cased',
    config=config,
)

language_sft = SFT('cambridgeltl/mbert-lang-sft-bxr-small') # SFT for Buryat
task_sft = SFT('cambridgeltl/mbert-task-sft-pos') # SFT for POS tagging

# Apply SFTs to pre-trained mBERT TokenClassification model
language_sft.apply(model)
task_sft.apply(model)

For a full list of pre-trained SFTs available, see MODELS

Example Scripts

Example scripts are provided in examples/ to show how to train SFTs using LT-SFT and evaluate them.

Citation

If you use this software, please cite the following paper:

@misc{ansell2021composable,
      title={Composable Sparse Fine-Tuning for Cross-Lingual Transfer},
      author={Alan Ansell and Edoardo Maria Ponti and Anna Korhonen and Ivan Vuli\'{c}},
      year={2021},
      eprint={2110.07560},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

This is a library for training and applying sparse fine-tunings with torch and transformers.

Related tags

Overview

Installation

Using pre-trained SFTs

Example Scripts

Citation

Owner

Cambridge Language Technology Lab

Tutorials and implementations for "Self-normalizing networks"

《K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters》(2020)

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

Pytorch code for paper "Image Compressed Sensing Using Non-local Neural Network" TMM 2021.

Source Code for ICSE 2022 Paper - ``Can We Achieve Fairness Using Semi-Supervised Learning?''

Joint deep network for feature line detection and description

Codes for "Template-free Prompt Tuning for Few-shot NER".

Multi-tool reverse engineering collaboration solution.

mlpack: a scalable C++ machine learning library --

PyTorch ,ONNX and TensorRT implementation of YOLOv4

a Lightweight library for sequential learning agents, including reinforcement learning

This is the official code of L2G, Unrolling and Recurrent Unrolling in Learning to Learn Graph Topologies.

PPO is a very popular Reinforcement Learning algorithm at present.

PyTorch inference for "Progressive Growing of GANs" with CelebA snapshot

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

RoboDesk A Multi-Task Reinforcement Learning Benchmark

Official codebase used to develop Vision Transformer, MLP-Mixer, LiT and more.

Pytorch implementation of winner from VQA Chllange Workshop in CVPR'17

The official github repository for Towards Continual Knowledge Learning of Language Models