Black-Box-Tuning

Source code for paper "Black-Box Tuning for Language-Model-as-a-Service".

Being busy recently, the code in this repo and this tutorial will be very brief. Please let me know if you find any issues.

Prepare your environment

The implementation of Black-Box Tuning is quite simple, you can check our code and easily implement it in your own environment. Or you can create a new environment to run our implementation, which is based on Nevergrad, Transformers and FastNLP. Optionally, we use fitlog to monitor experimental results. You can uncomment the fitlog-related lines in our code to use it.

conda create --name bbt python=3.8
conda activate bbt
pip install transformers==4.1.1
pip install datasets
pip install fastNLP
pip install nevergrad
pip install sklearn
git clone https://github.com/txsun1997/Black-Box-Tuning
cd Black-Box-Tuning

Optimize your prompt without gradients

Now you can run Black-Box Tuning with run.sh:

bash run.sh

Results will be saved in a directory named results/. In general, you will obtain the following results:

SST-2 split	Best Accuracy
Train	100
Dev	96.87
Test	88.19

To reproduce other experiments in our paper, change the arguments of bbt.py, for example,

python bbt.py --task_name "agnews" --n_prompt_tokens 50 --intrinsic_dim 500 --k_shot 16 --device "cuda:0" --seed 42 --loss_type "hinge" --cat_or_add "add" --budget 8000

Cite

If you find this work helpful, please cite:

@article{sun2022bbt,
  title={Black-Box Tuning for Language-Model-as-as-Service}, 
  author={Tianxiang Sun and Yunfan Shao and Hong Qian and Xuanjing Huang and Xipeng Qiu},
  journal={arXiv preprint arXiv:2201.03514},
  year={2022}
}

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

Related tags

Overview

Black-Box-Tuning

Prepare your environment

Optimize your prompt without gradients

Cite

Owner

Tianxiang Sun

Lane assist for ETS2, built with the ultra-fast-lane-detection model.

Python Classes: Medical Insurance Project using Object Oriented Programming Concepts

BiSeNet based on pytorch

[CVPR 2021] "Multimodal Motion Prediction with Stacked Transformers": official code implementation and project page.

Protect against subdomain takeover

E2VID_ROS - E2VID_ROS: E2VID to a real-time system

Unsupervised Attributed Multiplex Network Embedding (AAAI 2020)

pyspark🍒🥭 is delicious，just eat it!😋😋

Code and description for my BSc Project, September 2021

A benchmark dataset for mesh multi-label-classification based on cube engravings introduced in MeshCNN

An open source Python package for plasma science that is under development

Official implementation of "A Shared Representation for Photorealistic Driving Simulators" in PyTorch.

Lolviz - A simple Python data-structure visualization tool for lists of lists, lists, dictionaries; primarily for use in Jupyter notebooks / presentations

Contrastive Learning with Non-Semantic Negatives

[UNMAINTAINED] Automated machine learning for analytics & production

Graph Analysis From Scratch

Unofficial pytorch-lightning implement of Mip-NeRF

pytorch implementation of GPV-Pose

Monitor your ML jobs on mobile devices📱, especially for Google Colab / Kaggle

Cross-modal Retrieval using Transformer Encoder Reasoning Networks (TERN). With use of Metric Learning and FAISS for fast similarity search on GPU