NaturalProofs: Mathematical Theorem Proving in Natural Language

Last update: Jan 05, 2023

Related tags

Overview

NaturalProofs: Mathematical Theorem Proving in Natural Language

NaturalProofs: Mathematical Theorem Proving in Natural Language
Sean Welleck, Jiacheng Liu, Ronan Le Bras, Hannaneh Hajishirzi, Yejin Choi, Kyunghyun Cho

This repo contains:

The NaturalProofs Dataset and the mathematical reference retrieval task data.
Preprocessing NaturalProofs and the retrieval task data.
Training and evaluation for mathematical reference retrieval.
Pretrained models for mathematical reference retrieval.

Please cite our work if you found the resources in this repository useful:

@article{welleck2021naturalproofs,
  title={NaturalProofs: Mathematical Theorem Proving in Natural Language},
  author={Welleck, Sean and Liu, Jiacheng and Le Bras, Ronan and Hajishirzi, Hannaneh and Choi, Yejin and Cho, Kyunghyun},
  year={2021}
}

Section	Subsection
NaturalProofs Dataset	Dataset
	Preprocessing
Mathematical Reference Retrieval	Dataset
	Setup
	Preprocessing
	Pretrained models
	Training
	Evaluation

NaturalProofs Dataset

We provide the preprocessed NaturalProofs Dataset (JSON):

NaturalProofs Dataset
dataset.json [zenodo]

Preprocessing

To see the steps used to create the NaturalProofs dataset.json from raw ProofWiki data:

Download the ProofWiki XML.
Preprocess the data using notebooks/parse_proofwiki.ipynb.
Form the data splits using notebooks/dataset_splits.ipynb.

Mathematical Reference Retrieval

Dataset

The Mathematical Reference Retrieval dataset contains (x, r, y) examples with theorem statements x, positive and negative references r, and 0/1 labels y, derived from NaturalProofs.

We provide the version used in the paper (bert-based-cased tokenizer, 200 randomly sampled negatives):

Reference Retrieval Dataset
`bert-base-cased` 200 negatives

Pretrained Models

Pretrained models
`bert-base-cased`
`lstm`

These models were trained with the "bert-base-cased 200 negatives" dataset provided above.

Setup

python setup.py develop

You can see the DockerFile for additional version info, etc.

Generating and tokenizing

To create your own version of the retrieval dataset, use python utils.py.

This step is not needed if you are using the reference retrieval dataset provided above.

Example:

python utils.py --filepath /path/to/dataset.json --output-path /path/to/out/ --model-type bert-base-cased
# => Writing dataset to /path/to/out/dataset_tokenized__bert-base-cased_200.pkl

Evaluation

Using the retrieval dataset and a model provided above, we compute the test evaluation metrics in the paper:

Predict the rankings:

python naturalproofs/predict.py \
--method bert-base-cased \      # | lstm
--model-type bert-base-cased \  # | lstm
--datapath /path/to/dataset_tokenized__bert-base-cased_200.pkl \
--datapath-base /path/to/dataset.json \
--checkpoint-path /path/to/best.ckpt \
--output-dir /path/to/out/ \
--split test  # use valid during model development

Compute metrics over the rankings:

python naturalproofs/analyze.py \
--method bert-base-cased \      # | lstm
--eval-path /path/to/out/eval.pkl \
--analysis-path /path/to/out/analysis.pkl

Training

python naturalproofs/model.py \
--datapath /path/to/dataset_tokenized__bert-base-cased_200.pkl \
--default-root-dir /path/to/out/

Classical Retrieval Baselines

TF-IDF example:

python naturalproofs/baselines.py \
--method tfidf \
--datapath /path/to/dataset_tokenized__bert-base-cased_200.pkl \
--datapath-base /path/to/dataset.json \
--savedir /path/to/out/

Then use analyze.py as shown above to compute metrics.

NaturalProofs: Mathematical Theorem Proving in Natural Language

Related tags

Overview

NaturalProofs: Mathematical Theorem Proving in Natural Language

NaturalProofs Dataset

Preprocessing

Mathematical Reference Retrieval

Dataset

Pretrained Models

Setup

Generating and tokenizing

Evaluation

Training

Classical Retrieval Baselines

Owner

Sean Welleck

A library of extension and helper modules for Python's data analysis and machine learning libraries.

Semi-automated OpenVINO benchmark_app with variable parameters

LeafSnap replicated using deep neural networks to test accuracy compared to traditional computer vision methods.

We have implemented shaDow-GNN as a general and powerful pipeline for graph representation learning. For more details, please find our paper titled Deep Graph Neural Networks with Shallow Subgraph Samplers, available on arXiv (https//arxiv.org/abs/2012.01380).

A collection of 100 Deep Learning images and visualizations

网络协议2天集训

A scikit-learn compatible neural network library that wraps PyTorch

ArcaneGAN by Alex Spirin

The official PyTorch implementation for the paper "sMGC: A Complex-Valued Graph Convolutional Network via Magnetic Laplacian for Directed Graphs".

Code release for General Greedy De-bias Learning

Implementation of "Fast and Flexible Temporal Point Processes with Triangular Maps" (Oral @ NeurIPS 2020)

This repository builds a basic vision transformer from scratch so that one beginner can understand the theory of vision transformer.

DeLiGAN - This project is an implementation of the Generative Adversarial Network

GPU-Accelerated Deep Learning Library in Python

Python implementation of MULTIseq barcode alignment using fuzzy string matching and GMM barcode assignment

DeepOBS: A Deep Learning Optimizer Benchmark Suite

Siamese TabNet

This repository contains the code for designing risk bounded motion plans for car-like robot using Carla Simulator.

Supplemental learning materials for "Fourier Feature Networks and Neural Volume Rendering"

DeepLab2: A TensorFlow Library for Deep Labeling