This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”

Last update: Dec 11, 2022

Related tags

Deep Learning prompt_semantics

Overview

This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”

Usage

To replicate our results in Section 4, run:

python3 prompt_tune.py \
    --save-dir ../runs/prompt_tuned_sec4/ \
    --prompt-path ../data/binary_NLI_prompts.csv \
    --experiment-name sec4 \
    --few-shots 3,5,10,20,30,50,100,250 \
    --production \
    --seeds 1

Add --fully-train if you want to train on the entire training set in addition to few-shot settings.

To replicate Section 5, run:

python3 prompt_tune.py \
    --save-dir ../runs/prompt_tuned_sec5/ \
    --prompt-path ../data/binary_NLI_prompts_permuted.csv \
    --experiment-name sec5 \
    --few-shots 3,5,10,20,30,50,100,250 \
    --production \
    --seeds 1

To get a fine-tuning baseline (Figure 1):

python3 fine_tune.py \
    --save-dir ../runs/fine_tune/ \
    --epochs 5 \
    --few-shots 3,5,10,20,30,50,100,250 \
    --fully-train \
    --production \
    --seeds 1

To replicate our exact results, use --seeds 1,2,3,4,5,6,7,8, which yields starting_example_index of 550,231,974,966,1046,2350,1326,928 respectively. This is important for ensuring that all models trained under the same seed always see exactly the same training examples. See paper Section 3 for more details.

If these seeds do not generate the same starting_example_index for you (which you can check in the output CSV files), you will have to manually specify the few-shot subset of training examples. I plan to add an argparse argument for this to make it easy.

All other hyperparameters are the same as the argparse default.

Miscellaneous Notes

You might notice that the code and output files are set up to produce a fine-grained analysis of HANS (McCoy et al., 2019). We actually run all of our main experiments on HANS as well and got similar results, which we plan to write up in a future version of our paper. Meanwhile, if you’re curious, feel free to add --do-diagnosis which will report the results on HANS.

Requirements

Python 3.9.

3.7 should mostly work too. You’d have to just replace the new built-in type hints and dictionary union operators with their older equivalents.

Activate your preferred virtual envrionment and then run pip install -r requirements.txt. If you want to replicate our exact results, use

torch==1.9.0+cu111
transformers==4.9.2
datasets==1.11.0

This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”

Related tags

Overview

Usage

Miscellaneous Notes

Requirements

Owner

Albert Webson

LibFewShot: A Comprehensive Library for Few-shot Learning.

Chainer Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)

TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

A short and easy PyTorch implementation of E(n) Equivariant Graph Neural Networks

Pytorch implementation of Learning Rate Dropout.

ruptures: change point detection in Python

Official repository for the ISBI 2021 paper Transformer Assisted Convolutional Neural Network for Cell Instance Segmentation

DaReCzech is a dataset for text relevance ranking in Czech

A pytorch-based real-time segmentation model for autonomous driving

TuckER: Tensor Factorization for Knowledge Graph Completion

Source Code For Template-Based Named Entity Recognition Using BART

A project to make Amazon Echo respond to sign language using your webcam

🏆 The 1st Place Submission to AICity Challenge 2021 Natural Language-Based Vehicle Retrieval Track (Alibaba-UTS submission)

Optimize Trading Strategies Using Freqtrade

Repositorio oficial del curso IIC2233 Programación Avanzada 🚀✨

TCube generates rich and fluent narratives that describes the characteristics, trends, and anomalies of any time-series data (domain-agnostic) using the transfer learning capabilities of PLMs.

Tensorflow implementation of Semi-supervised Sequence Learning (https://arxiv.org/abs/1511.01432)

Official implementation of the paper "Topographic VAEs learn Equivariant Capsules"

DARTS-: Robustly Stepping out of Performance Collapse Without Indicators

Official implementation of the ICLR 2021 paper