Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"

Last update: Oct 21, 2022

Related tags

Overview

Can NLI Models Verify QA Systems' Predictions?

This repository contains the data and code for the following paper:

**Can NLI Models Verify QA Systems' Predictions? **
Jifan Chen, Eunsol Choi, Greg Durrett
EMNLP 2021 Findings

@article{chen2021can,
  title={Can NLI Models Verify QA Systems' Predictions?},
  author={Chen, Jifan and Choi, Eunsol and Durrett, Greg},
  journal={EMNLP Findings},
  year={2021}
}

Datasets

The NLI data converted from QA datasets through our pipeline described in the paper can be found here

Data Format

The data files are formatted as jsonlines; each example is described as the following:

Field	Description
`example_id`	Example ID
`title_text`	Title of the Wikipedia page of the example, could be NONE
`paragraph_text`	Paragraph containing the answer
`question_text`	Question
`answer_text`	Answer of the question
`answer_sent_text`	Sentence containing the answer
`decontext_answer_sent_text`	Decontextualized sentence containing the answer
`question_statement_text`	Declarative version of the question by combining the answer
`answer_scores`	Top 5 Answer score computed by the QA(BERT-joint) model
`is_correct`	Whether the answer is correct
`answer_sent_text`	Sentence containing the answer

Models

Getting started

git clone https://github.com/jifan-chen/QA-Verification-Via-NLI.git

Install the dependencies by running pip install -r requirements.txt

Question Converter & Decontextualizer

See README in seq2seq_converter.

NQ-NLI

coming soon

Contact

Please contact at [email protected] if you have any questions.

Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"

Related tags

Overview

Can NLI Models Verify QA Systems' Predictions?

Datasets

Data Format

Models

Getting started

Question Converter & Decontextualizer

NQ-NLI

Contact

Owner

Jifan Chen

KLUE-baseline contains the baseline code for the Korean Language Understanding Evaluation (KLUE) benchmark.

SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search

Implementation for paper BLEU: a Method for Automatic Evaluation of Machine Translation

Chatbot with Pytorch, Python & Nextjs

Source code for the paper "TearingNet: Point Cloud Autoencoder to Learn Topology-Friendly Representations"

NeurIPS'21: Probabilistic Margins for Instance Reweighting in Adversarial Training (Pytorch implementation).

The official repository of the ISBI 2022 KNIGHT Challenge

Mastering Transformers, published by Packt

This repository structures data in title, summary, tags, sentiment given a fragment of a conversation

Korean stereoypte detector with TUNiB-Electra and K-StereoSet

Toward a Visual Concept Vocabulary for GAN Latent Space, ICCV 2021

A Paper List for Speech Translation

Adversarial Examples for Extreme Multilabel Text Classification

Reformer, the efficient Transformer, in Pytorch

Russian words synonyms and antonyms

Twitter-Sentiment-Analysis - Analysis of twitter posts' positive and negative score.

Neural-Machine-Translation - Implementation of revolutionary machine translation models

Telegram bot to auto post messages of one channel in another channel as soon as it is posted, without the forwarded tag.

A program that uses real statistics to choose the best times to bet on BloxFlip's crash gamemode

Implementation of "Adversarial purification with Score-based generative models", ICML 2021