This is a library for training and applying sparse fine-tunings with torch and transformers.

Last update: Dec 30, 2022

Related tags

Overview

This is a library for training and applying sparse fine-tunings with torch and transformers. Please refer to our paper Composable Sparse Fine-Tuning for Cross Lingual Transfer for background.

Installation

First, install Python 3.9 and PyTorch >= 1.9 (earlier versions may work but haven't been tested), e.g. using conda:

conda create -n sft python=3.9
conda activate sft
conda install pytorch cudatoolkit=11.1 -c pytorch -c conda-forge

Then download and install composable-sft:

git clone https://github.com/cambridgeltl/composable-sft.git
cd composable-sft
pip install -e .

Using pre-trained SFTs

Pre-trained SFTs can be downloaded directly and applied to models as follows:

from transformers import AutoConfig, AutoModelForTokenClassification
from sft import SFT

config = AutoConfig.from_pretrained(
    'bert-base-multilingual-cased',
    num_labels=17,
)

model = AutoModelForTokenClassification.from_pretrained(
    'bert-base-multilingual-cased',
    config=config,
)

language_sft = SFT('cambridgeltl/mbert-lang-sft-bxr-small') # SFT for Buryat
task_sft = SFT('cambridgeltl/mbert-task-sft-pos') # SFT for POS tagging

# Apply SFTs to pre-trained mBERT TokenClassification model
language_sft.apply(model)
task_sft.apply(model)

For a full list of pre-trained SFTs available, see MODELS

Example Scripts

Example scripts are provided in examples/ to show how to train SFTs using LT-SFT and evaluate them.

Citation

If you use this software, please cite the following paper:

@misc{ansell2021composable,
      title={Composable Sparse Fine-Tuning for Cross-Lingual Transfer},
      author={Alan Ansell and Edoardo Maria Ponti and Anna Korhonen and Ivan Vuli\'{c}},
      year={2021},
      eprint={2110.07560},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

This is a library for training and applying sparse fine-tunings with torch and transformers.

Related tags

Overview

Installation

Using pre-trained SFTs

Example Scripts

Citation

Owner

Cambridge Language Technology Lab

Multi-label classification of retinal disorders

Out-of-Distribution Generalization of Chest X-ray Using Risk Extrapolation

Lightweight tool to perform MITM attack on local network

arxiv-sanity, but very lite, simply providing the core value proposition of the ability to tag arxiv papers of interest and have the program recommend similar papers.

[NeurIPS 2021] Source code for the paper "Qu-ANTI-zation: Exploiting Neural Network Quantization for Achieving Adversarial Outcomes"

A library for low-memory inferencing in PyTorch.

A map update dataset and benchmark

Pytorch implementations of the paper Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients

UT-Sarulab MOS prediction system using SSL models

Centroid-UNet is deep neural network model to detect centroids from satellite images.

CR-FIQA: Face Image Quality Assessment by Learning Sample Relative Classifiability

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"

Contrastive Feature Loss for Image Prediction

Custom IMDB Dataset is extracted between 2020-2021 and custom distilBERT model is trained for movie success probability prediction

Speech recognition tool to convert audio to text transcripts, for Linux and Raspberry Pi.

Joint parameterization and fitting of stroke clusters

Transformer Tracking (CVPR2021)

PhysCap: Physically Plausible Monocular 3D Motion Capture in Real Time

Graph Transformer Architecture. Source code for

Implementation of the ivis algorithm as described in the paper Structure-preserving visualisation of high dimensional single-cell datasets.