Black-Box-Tuning

Source code for paper "Black-Box Tuning for Language-Model-as-a-Service".

Being busy recently, the code in this repo and this tutorial will be very brief. Please let me know if you find any issues.

Prepare your environment

The implementation of Black-Box Tuning is quite simple, you can check our code and easily implement it in your own environment. Or you can create a new environment to run our implementation, which is based on Nevergrad, Transformers and FastNLP. Optionally, we use fitlog to monitor experimental results. You can uncomment the fitlog-related lines in our code to use it.

conda create --name bbt python=3.8
conda activate bbt
pip install transformers==4.1.1
pip install datasets
pip install fastNLP
pip install nevergrad
pip install sklearn
git clone https://github.com/txsun1997/Black-Box-Tuning
cd Black-Box-Tuning

Optimize your prompt without gradients

Now you can run Black-Box Tuning with run.sh:

bash run.sh

Results will be saved in a directory named results/. In general, you will obtain the following results:

SST-2 split	Best Accuracy
Train	100
Dev	96.87
Test	88.19

To reproduce other experiments in our paper, change the arguments of bbt.py, for example,

python bbt.py --task_name "agnews" --n_prompt_tokens 50 --intrinsic_dim 500 --k_shot 16 --device "cuda:0" --seed 42 --loss_type "hinge" --cat_or_add "add" --budget 8000

Cite

If you find this work helpful, please cite:

@article{sun2022bbt,
  title={Black-Box Tuning for Language-Model-as-as-Service}, 
  author={Tianxiang Sun and Yunfan Shao and Hong Qian and Xuanjing Huang and Xipeng Qiu},
  journal={arXiv preprint arXiv:2201.03514},
  year={2022}
}

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

Related tags

Overview

Black-Box-Tuning

Prepare your environment

Optimize your prompt without gradients

Cite

Owner

Tianxiang Sun

A novel pipeline framework for multi-hop complex KGQA task. About the paper title: Improving Multi-hop Embedded Knowledge Graph Question Answering by Introducing Relational Chain Reasoning

Multi-objective gym environments for reinforcement learning.

Novel Instances Mining with Pseudo-Margin Evaluation for Few-Shot Object Detection

Code for WSDM 2022 paper, Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation.

MvtecAD unsupervised Anomaly Detection

This repository includes different versions of the prescribed-time controller as Simulink blocks and MATLAB script codes for engineering applications.

EFENet: Reference-based Video Super-Resolution with Enhanced Flow Estimation

This repo is official PyTorch implementation of MobileHumanPose: Toward real-time 3D human pose estimation in mobile devices(CVPRW 2021).

Deep Multi-Magnification Network for multi-class tissue segmentation of whole slide images

a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch

Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning

the official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

YOLOv5🚀 reproduction by Guo Quanhao using PaddlePaddle

A facial recognition doorbell system using a Raspberry Pi

Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode

Minimal But Practical Image Classifier Pipline Using Pytorch, Finetune on ResNet18, Got 99% Accuracy on Own Small Datasets.

A fast, dataset-agnostic, deep visual search engine for digital art history

A library for researching neural networks compression and acceleration methods.

Using image super resolution models with vapoursynth and speeding them up with TensorRT

A Closer Look at Reference Learning for Fourier Phase Retrieval