Database Reasoning Over Text project for ACL paper

Last update: Dec 12, 2022

Related tags

Overview

Database Reasoning over Text

This repository contains the code for the Database Reasoning Over Text paper, to appear at ACL2021. Work is performed in collaboration with James Thorne, Majid Yazdani, Marzieh Saeidi, Fabrizio Silvestri, Sebastian Riedel, and Alon Halevy.

Data

The completed NeuralDB datasets can be downloaded here and are released under a CC BY-SA 3.0 license.

The dataset includes entity names from Wikidata which are released under a CC BY-SA 3.0 license. This dataset includes sentences from the KELM corpus. KELM is released under the CC BY-SA 2.0 license

Repository Structure

The repository is structured in 3 sub-folders:

Tools for mapping the KELM data to Wikidata identifiers are provided in the dataset construction folder ,
The information retrieval system for the support set generator are provided in the ssg folder
The models for Neural SPJ, the baseline retrieval (TF-IDF and DPR), and evaluation scripts are provided in the modelling folder.

Instructions for running each component are provided in the README files in the respective sub-folders.

Setup

All sub-folders were set up with one Python environment per folder. Requirements for each environment can be installed by running a pip install:

pip install -r requirements.txt

In the dataset-construction and modelling folders, the src folder should be included in the python path.

export PYTHONPATH=src

License

The code in this repository is released under the Apache 2.0 license

Database Reasoning Over Text project for ACL paper

Related tags

Overview

Database Reasoning over Text

Data

Repository Structure

Setup

License

Owner

Facebook Research

Stacked Hourglass Network with a Multi-level Attention Mechanism: Where to Look for Intervertebral Disc Labeling

CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.

MEND: Model Editing Networks using Gradient Decomposition

Detecting Blurred Ground-based Sky/Cloud Images

Learning to Estimate Hidden Motions with Global Motion Aggregation

State-to-Distribution (STD) Model

[NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.

🌾 PASTIS 🌾 Panoptic Agricultural Satellite TIme Series

A FAIR dataset of TCV experimental results for validating edge/divertor turbulence models.

This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.

A Large-Scale Dataset for Spinal Vertebrae Segmentation in Computed Tomography

Implementation supporting the ICCV 2017 paper "GANs for Biological Image Synthesis"

Vector Quantized Diffusion Model for Text-to-Image Synthesis

Gym for multi-agent reinforcement learning

SciPy fixes and extensions

Official Implementation of VAT

A Pytorch implementation of "Manifold Matching via Deep Metric Learning for Generative Modeling" (ICCV 2021)

Utilizes Pose Estimation to offer sprinters cues based on an image of their running form.

Here is the implementation of our paper S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations.

Code and models for "Rethinking Deep Image Prior for Denoising" (ICCV 2021)