Contrastive Fact Verification

Last update: Dec 19, 2022

Related tags

Overview

VitaminC

This repository contains the dataset and models for the NAACL 2021 paper: Get Your Vitamin C! Robust Fact Verification with Contrastive Evidence. The VitaminC dataset contains more than 450,000 claim-evidence pairs from over 100,000 revisions to popular Wikipedia pages, and additional "synthetic" revisions.

We're still updating this repo. More to come soon. Please reach out to us if you have any questions.

Below are instructions for the four main tasks described in the paper:

Revision Flagging
Fact Verification
Word-level Rationales
Factually Consistent Generation

Install

If you're only interested in the dataset (in jsonlines format), please find the per-task links below.

To install this pacakage with the code to process the dataset and run transformer models and baselines, run:

python setup.py install

Note: python>=3.7 is needed for all the dependencies to work.

Revision Flagging

VitaminC revision flagging data (the script below will automatically download it): link

Example of evaluating ALBERT-base model on the test dataset:

sh scripts/run_flagging.sh

The BOW and edit distance baselines from the paper are in scripts/factual_flagging_baselines.py.

Fact Verification

VitaminC fact verification data (the script below will automatically download it): link

Example of evaluating ALBERT-base model fine-tuned with VitaminC and FEVER datasets on the "real" and "synthetic" test sets of VitaminC:

sh scripts/run_fact_verification.sh

To evaluate the same model on another jsonlines file (containing claim, evidence, and label fields). Use:

sh scripts/run_fact_verification.sh path_to_test_file

Other available pretrained models (including the ALBERT-xlarge model that performed the best):

tals/albert-base-vitaminc
tals/albert-base-vitaminc-mnli
tals/albert-base-vitaminc-fever
tals/albert-xlarge-vitaminc
tals/albert-xlarge-vitaminc-mnli
tals/albert-xlarge-vitaminc-fever

Word-level Rationales

Will be added soon

Factually Consistent Generation

Will be added soon

Citation

If you find our code and/or data useful, please cite our paper:

@InProceedings{Schuster2019,
    title = "Get Your Vitamin C! Robust Fact Verification with Contrastive Evidence",
    author="Tal Schuster and Adam Fisch and Regina Barzilay",
    booktitle = "NAACL 2021",
    year = "2021",
    url = "https://arxiv.org/abs/2103.08541",
}

Contrastive Fact Verification

Related tags

Overview

VitaminC

Install

Revision Flagging

Fact Verification

Word-level Rationales

Factually Consistent Generation

Citation

Owner

Code for CoMatch: Semi-supervised Learning with Contrastive Graph Regularization

High-Resolution 3D Human Digitization from A Single Image.

Pytorch implementation for RelTransformer

ThunderGBM: Fast GBDTs and Random Forests on GPUs

Recurrent Scale Approximation (RSA) for Object Detection

Inference pipeline for our participation in the FeTA challenge 2021.

This is the official code of our paper "Diversity-based Trajectory and Goal Selection with Hindsight Experience Relay" (PRICAI 2021)

Image-to-Image Translation in PyTorch

LightSeq is a high performance training and inference library for sequence processing and generation implemented in CUDA

Learning Neural Network Subspaces

A new test set for ImageNet

Just-Now - This Is Just Now Login Friendlist Cloner Tools

ICON: Implicit Clothed humans Obtained from Normals (CVPR 2022)

Keras implementations of Generative Adversarial Networks.

This Jupyter notebook shows one way to implement a simple first-order low-pass filter on sampled data in discrete time.

Intelligent Video Analytics toolkit based on different inference backends.

Synthesizing Long-Term 3D Human Motion and Interaction in 3D in CVPR2021

Control-Robot-Arm-using-PS4-Controller - A Robotic Arm based on Raspberry Pi and Arduino that controlled by PS4 Controller

Neurons Dataset API - The official dataloader and visualization tools for Neurons Datasets.

This is a code repository for the paper "Graph Auto-Encoders for Financial Clustering".