Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Last update: Dec 23, 2022

Related tags

Overview

Surface Form Competition

This is the official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right" We provide scripts for downloading/processing datasets and for reproducing our results on GPT-2 and GPT-3. We do not guarantee exact reproducibility, as library versions and GPUs may cause small differences, but these should be extremely minor.

Dependencies

We use python3 and pytorch 1.7.0, but we do not use cutting-edge features from either and expect to be largely forward and backward compatible. That is not a guarantee or promise.

You can use pip install -r requirements.txt to install the required libraries.

OpenAI Beta

To use GPT-3 you must use OpenAI Beta, which is limited access. You can apply for access here. Once you have access you will need to point the score.py to your API key with the --key argument or put your key in api.key which is the default path.

Downloading Datasets

DATA_README.md has thorough instructions for downloading and processing datasets. We provide automatic downloaders and processers for datasets where possible in data_downloaders/ but see DATA_README for full instructions.

Running Scorers

Once you have a dataset downloaded, running all the zero-shot scoring strategies at once is as simple as:

python score.py 
   
     --model

where is the abbreviation for a given dataset used for table rows in the paper. If there is any confusion, simply look in score.py to see how dataset selection works. is the name of either a GPT-2 or GPT-3 model e.g. xl, davinci, etc. To speed things up you can use a larger --batch if you have enough GPU memory.

Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Related tags

Overview

Surface Form Competition

Dependencies

OpenAI Beta

Downloading Datasets

Running Scorers

Owner

Peter West

Code for Understanding Pooling in Graph Neural Networks

Rayvens makes it possible for data scientists to access hundreds of data services within Ray with little effort.

Learning nonlinear operators via DeepONet

GLNet for Memory-Efficient Segmentation of Ultra-High Resolution Images

Deploy optimized transformer based models on Nvidia Triton server

The official implementation of "Rethink Dilated Convolution for Real-time Semantic Segmentation"

Code for the USENIX 2017 paper: kAFL: Hardware-Assisted Feedback Fuzzing for OS Kernels

Bayesian optimization in PyTorch

Band-Adaptive Spectral-Spatial Feature Learning Neural Network for Hyperspectral Image Classification

This repository is the official implementation of Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning (NeurIPS21).

MG-GCN: Scalable Multi-GPU GCN Training Framework

【ACMMM 2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning

Dynamic Environments with Deformable Objects (DEDO)

Pytorch implementation of four neural network based domain adaptation techniques: DeepCORAL, DDC, CDAN and CDAN+E. Evaluated on benchmark dataset Office31.

Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation

An image processing project uses Viola-jones technique to detect faces and then use SIFT algorithm for recognition.

SCU OlympicsRunning Baseline

Classification Modeling: Probability of Default

Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm

Fuzzer for Linux Kernel Drivers