Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"

Last update: Nov 14, 2022

Related tags

Overview

Time-Sensitive-QA

The repo contains the dataset and code for NeurIPS2021 (dataset track) paper Time-Sensitive Question Answering dataset. The dataset is collected by UCSB NLP group and issued under BSD 3-Clause "New" or "Revised" License.

This dataset is aimed to study the existing reading comprehension models' capability to perform temporal reasoning, and see whether they are sensitive to the temporal description in the given question. An example of annotated question-answer pairs are listed as follows:

Repo Structure

dataset/: this folder contains all the dataset
dataset/annotated*: these files are the annotated (passage, time-evolving facts) by crowd-workers.
dataset/train-dev-test: these files are synthesized using templates, including both easy and hard versions.
BigBird/: all the running code for BigBird models
FiD/: all the running code for fusion-in-decoder models

Requirements

BigBird-Specific Requirements

FiD-Specific Requirements

BigBird

Extractive QA baseline model, first switch to the BigBird Conda environment:

Initialize from NQ checkpoint

Running Training (Hard)

    python -m BigBird.main model_id=nq dataset=hard cuda=[DEVICE] mode=train per_gpu_train_batch_size=8

Running Evaluation (Hard)

    python -m BigBird.main model_id=nq dataset=hard cuda=[DEVICE] mode=eval model_path=[YOUR_MODEL]

Initialize from TriviaQA checkpoint

Running Training (Hard)

    python -m BigBird.main model_id=triviaqa dataset=hard cuda=[DEVICE] mode=train per_gpu_train_batch_size=2

Running Evaluation (Hard)

    python -m BigBird.main model_id=triviaqa dataset=hard mode=eval cuda=[DEVICE] model_path=[YOUR_MODEL]

Fusion-in Decoder

Generative QA baseline model, first switch to the FiD Conda environment:

Initialize from NQ checkpoint

Running Training (Hard)

    python -m FiD.main mode=train dataset=hard model_path=/data2/wenhu/Time-Sensitive-QA/FiD/pretrained_models/nq_reader_base/

Running Evaluation (Hard)

    python -m FiD.main mode=eval cuda=3 dataset=hard model_path=[YOUR_MODEL]

Running Evalution on Human-Test (Hard)

    python -m FiD.main mode=eval cuda=3 dataset=human_hard model_path=[YOUR_MODEL]

Initialize from TriviaQA checkpoint

Running Training (Hard)

    python -m FiD.main mode=train dataset=hard model_path=/data2/wenhu/Time-Sensitive-QA/FiD/pretrained_models/tqa_reader_base/

Running Evaluation (Hard)

    python -m FiD.main mode=eval cuda=3 dataset=hard model_path=[YOUR_MODEL]

Running Evalution on Human-Test (Hard)

    python -m FiD.main mode=eval cuda=3 dataset=human_hard model_path=[YOUR_MODEL]

License

The data and code are released under BSD 3-Clause "New" or "Revised" License.

Report

Please create an issue or send an email to [email protected] for any questions/bugs/etc.

Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"

Related tags

Overview

Time-Sensitive-QA

Repo Structure

Requirements

BigBird

Initialize from NQ checkpoint

Initialize from TriviaQA checkpoint

Fusion-in Decoder

Initialize from NQ checkpoint

Initialize from TriviaQA checkpoint

License

Report

Owner

wenhu chen

Data and code for ICCV 2021 paper Distant Supervision for Scene Graph Generation.

Code for Transformer Hawkes Process, ICML 2020.

This repository contains the code for the paper in EMNLP 2021: "HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression".

C3D is a modified version of BVLC caffe to support 3D ConvNets.

COIN the currently largest dataset for comprehensive instruction video analysis.

Largest list of models for Core ML (for iOS 11+)

Deep Learning for Human Part Discovery in Images - Chainer implementation

Pytorch Implementation of "Diagonal Attention and Style-based GAN for Content-Style disentanglement in image generation and translation" (ICCV 2021)

GeneDisco is a benchmark suite for evaluating active learning algorithms for experimental design in drug discovery.

Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences forImage-Text Retrieval

Deep Learning Datasets Maker is a QGIS plugin to make datasets creation easier for raster and vector data.

AirLoop: Lifelong Loop Closure Detection

MPLP: Metapath-Based Label Propagation for Heterogenous Graphs

Generic image compressor for machine learning. Pytorch code for our paper "Lossy compression for lossless prediction".

Anomaly Detection Based on Hierarchical Clustering of Mobile Robot Data

Code for ACL'2021 paper WARP 🌀 Word-level Adversarial ReProgramming

ContourletNet: A Generalized Rain Removal Architecture Using Multi-Direction Hierarchical Representation

Pose Transformers: Human Motion Prediction with Non-Autoregressive Transformers

novel deep learning research works with PaddlePaddle

Collection of sports betting AI tools.