NaturalProofs: Mathematical Theorem Proving in Natural Language

Last update: Jan 05, 2023

Related tags

Overview

NaturalProofs: Mathematical Theorem Proving in Natural Language

NaturalProofs: Mathematical Theorem Proving in Natural Language
Sean Welleck, Jiacheng Liu, Ronan Le Bras, Hannaneh Hajishirzi, Yejin Choi, Kyunghyun Cho

This repo contains:

The NaturalProofs Dataset and the mathematical reference retrieval task data.
Preprocessing NaturalProofs and the retrieval task data.
Training and evaluation for mathematical reference retrieval.
Pretrained models for mathematical reference retrieval.

Please cite our work if you found the resources in this repository useful:

@article{welleck2021naturalproofs,
  title={NaturalProofs: Mathematical Theorem Proving in Natural Language},
  author={Welleck, Sean and Liu, Jiacheng and Le Bras, Ronan and Hajishirzi, Hannaneh and Choi, Yejin and Cho, Kyunghyun},
  year={2021}
}

Section	Subsection
NaturalProofs Dataset	Dataset
	Preprocessing
Mathematical Reference Retrieval	Dataset
	Setup
	Preprocessing
	Pretrained models
	Training
	Evaluation

NaturalProofs Dataset

We provide the preprocessed NaturalProofs Dataset (JSON):

NaturalProofs Dataset
dataset.json [zenodo]

Preprocessing

To see the steps used to create the NaturalProofs dataset.json from raw ProofWiki data:

Download the ProofWiki XML.
Preprocess the data using notebooks/parse_proofwiki.ipynb.
Form the data splits using notebooks/dataset_splits.ipynb.

Mathematical Reference Retrieval

Dataset

The Mathematical Reference Retrieval dataset contains (x, r, y) examples with theorem statements x, positive and negative references r, and 0/1 labels y, derived from NaturalProofs.

We provide the version used in the paper (bert-based-cased tokenizer, 200 randomly sampled negatives):

Reference Retrieval Dataset
`bert-base-cased` 200 negatives

Pretrained Models

Pretrained models
`bert-base-cased`
`lstm`

These models were trained with the "bert-base-cased 200 negatives" dataset provided above.

Setup

python setup.py develop

You can see the DockerFile for additional version info, etc.

Generating and tokenizing

To create your own version of the retrieval dataset, use python utils.py.

This step is not needed if you are using the reference retrieval dataset provided above.

Example:

python utils.py --filepath /path/to/dataset.json --output-path /path/to/out/ --model-type bert-base-cased
# => Writing dataset to /path/to/out/dataset_tokenized__bert-base-cased_200.pkl

Evaluation

Using the retrieval dataset and a model provided above, we compute the test evaluation metrics in the paper:

Predict the rankings:

python naturalproofs/predict.py \
--method bert-base-cased \      # | lstm
--model-type bert-base-cased \  # | lstm
--datapath /path/to/dataset_tokenized__bert-base-cased_200.pkl \
--datapath-base /path/to/dataset.json \
--checkpoint-path /path/to/best.ckpt \
--output-dir /path/to/out/ \
--split test  # use valid during model development

Compute metrics over the rankings:

python naturalproofs/analyze.py \
--method bert-base-cased \      # | lstm
--eval-path /path/to/out/eval.pkl \
--analysis-path /path/to/out/analysis.pkl

Training

python naturalproofs/model.py \
--datapath /path/to/dataset_tokenized__bert-base-cased_200.pkl \
--default-root-dir /path/to/out/

Classical Retrieval Baselines

TF-IDF example:

python naturalproofs/baselines.py \
--method tfidf \
--datapath /path/to/dataset_tokenized__bert-base-cased_200.pkl \
--datapath-base /path/to/dataset.json \
--savedir /path/to/out/

Then use analyze.py as shown above to compute metrics.

NaturalProofs: Mathematical Theorem Proving in Natural Language

Related tags

Overview

NaturalProofs: Mathematical Theorem Proving in Natural Language

NaturalProofs Dataset

Preprocessing

Mathematical Reference Retrieval

Dataset

Pretrained Models

Setup

Generating and tokenizing

Evaluation

Training

Classical Retrieval Baselines

Owner

Sean Welleck

Torch-ngp - A pytorch implementation of the hash encoder proposed in instant-ngp

Allele-specific pipeline for unbiased read mapping(WIP), QTL discovery(WIP), and allelic-imbalance analysis

Official repository for HOTR: End-to-End Human-Object Interaction Detection with Transformers (CVPR'21, Oral Presentation)

Text and code for the forthcoming second edition of Think Bayes, by Allen Downey.

MaRS - a recursive filtering framework that allows for truly modular multi-sensor integration

N-gram models- Unsmoothed, Laplace, Deleted Interpolation

VOGUE: Try-On by StyleGAN Interpolation Optimization

An Evaluation of Generative Adversarial Networks for Collaborative Filtering.

"SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image", Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Humphrey Shi, Zhangyang Wang

[ECE NTUA] 👁 Computer Vision - Lab Projects & Theoretical Problem Sets (2020-2021)

An executor that performs image segmentation on fashion items

📝 Wrapper library for text generation / language models at char and word level with RNN in TensorFlow

Code and training data for our ECCV 2016 paper on Unsupervised Learning

Code for "Diversity can be Transferred: Output Diversification for White- and Black-box Attacks"

Additional code for Stable-baselines3 to load and upload models from the Hub.

Face Detection & Age Gender & Expression & Recognition

You Only Look Once for Panopitic Driving Perception

Dense Prediction Transformers

The CLRS Algorithmic Reasoning Benchmark

3D-Transformer: Molecular Representation with Transformer in 3D Space