Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"

Last update: Oct 21, 2022

Related tags

Overview

Can NLI Models Verify QA Systems' Predictions?

This repository contains the data and code for the following paper:

**Can NLI Models Verify QA Systems' Predictions? **
Jifan Chen, Eunsol Choi, Greg Durrett
EMNLP 2021 Findings

@article{chen2021can,
  title={Can NLI Models Verify QA Systems' Predictions?},
  author={Chen, Jifan and Choi, Eunsol and Durrett, Greg},
  journal={EMNLP Findings},
  year={2021}
}

Datasets

The NLI data converted from QA datasets through our pipeline described in the paper can be found here

Data Format

The data files are formatted as jsonlines; each example is described as the following:

Field	Description
`example_id`	Example ID
`title_text`	Title of the Wikipedia page of the example, could be NONE
`paragraph_text`	Paragraph containing the answer
`question_text`	Question
`answer_text`	Answer of the question
`answer_sent_text`	Sentence containing the answer
`decontext_answer_sent_text`	Decontextualized sentence containing the answer
`question_statement_text`	Declarative version of the question by combining the answer
`answer_scores`	Top 5 Answer score computed by the QA(BERT-joint) model
`is_correct`	Whether the answer is correct
`answer_sent_text`	Sentence containing the answer

Models

Getting started

git clone https://github.com/jifan-chen/QA-Verification-Via-NLI.git

Install the dependencies by running pip install -r requirements.txt

Question Converter & Decontextualizer

See README in seq2seq_converter.

NQ-NLI

coming soon

Contact

Please contact at [email protected] if you have any questions.

Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"

Related tags

Overview

Can NLI Models Verify QA Systems' Predictions?

Datasets

Data Format

Models

Getting started

Question Converter & Decontextualizer

NQ-NLI

Contact

Owner

Jifan Chen

Host your own GPT-3 Discord bot

An open source library for deep learning end-to-end dialog systems and chatbots.

Mkdocs + material + cool stuff

Google and Stanford University released a new pre-trained model called ELECTRA

Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"

BERT, LDA, and TFIDF based keyword extraction in Python

The code for two papers: Feedback Transformer and Expire-Span.

Unofficial Python library for using the Polish Wordnet (plWordNet / Słowosieć)

Kerberoast with ACL abuse capabilities

Longformer: The Long-Document Transformer

Multilingual text (NLP) processing toolkit

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Wikipedia-Utils: Preprocessing Wikipedia Texts for NLP

jiant is an NLP toolkit

pyMorfologik MorfologikpyMorfologik - Python binding for Morfologik.

Twitter-Sentiment-Analysis - Twitter sentiment analysis for india's top online retailers(2019 to 2022)

A flask application to predict the speech emotion of any .wav file.

A website which allows you to play with the GPT-2 transformer

Ongoing research training transformer language models at scale, including: BERT & GPT-2

👑 spaCy building blocks and visualizers for Streamlit apps