Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Last update: Dec 23, 2022

Related tags

Overview

Surface Form Competition

This is the official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right" We provide scripts for downloading/processing datasets and for reproducing our results on GPT-2 and GPT-3. We do not guarantee exact reproducibility, as library versions and GPUs may cause small differences, but these should be extremely minor.

Dependencies

We use python3 and pytorch 1.7.0, but we do not use cutting-edge features from either and expect to be largely forward and backward compatible. That is not a guarantee or promise.

You can use pip install -r requirements.txt to install the required libraries.

OpenAI Beta

To use GPT-3 you must use OpenAI Beta, which is limited access. You can apply for access here. Once you have access you will need to point the score.py to your API key with the --key argument or put your key in api.key which is the default path.

Downloading Datasets

DATA_README.md has thorough instructions for downloading and processing datasets. We provide automatic downloaders and processers for datasets where possible in data_downloaders/ but see DATA_README for full instructions.

Running Scorers

Once you have a dataset downloaded, running all the zero-shot scoring strategies at once is as simple as:

python score.py 
   
     --model

where is the abbreviation for a given dataset used for table rows in the paper. If there is any confusion, simply look in score.py to see how dataset selection works. is the name of either a GPT-2 or GPT-3 model e.g. xl, davinci, etc. To speed things up you can use a larger --batch if you have enough GPU memory.

Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Related tags

Overview

Surface Form Competition

Dependencies

OpenAI Beta

Downloading Datasets

Running Scorers

Owner

Peter West

Ontologysim: a Owlready2 library for applied production simulation

Haze Removal can remove slight to extreme cases of haze affecting an image

Starter kit for getting started in the Music Demixing Challenge.

Poisson Surface Reconstruction for LiDAR Odometry and Mapping

Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.

A list of awesome PyTorch scholarship articles, guides, blogs, courses and other resources.

Official DGL implementation of "Rethinking High-order Graph Convolutional Networks"

Flexible time series feature extraction & processing

The code of Zero-shot learning for low-light image enhancement based on dual iteration

Categorizing comments on YouTube into different categories.

"Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback"

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"

This repository contains implementations and illustrative code to accompany DeepMind publications

Spontaneous Facial Micro Expression Recognition using 3D Spatio-Temporal Convolutional Neural Networks

Official implement of "CAT: Cross Attention in Vision Transformer".

Causal Influence Detection for Improving Efficiency in Reinforcement Learning

An implementation of the WHATWG URL Standard in JavaScript

FAVD: Featherweight Assisted Vulnerability Discovery

Based on Stockfish neural network(similar to LcZero)

Jarvis Project is a basic virtual assistant that uses TensorFlow for learning.