[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Last update: Jan 06, 2023

Overview

REval

Introduction
Overview
Requirements
Installation
Probing
Usage
Citation
License

🎓 Introduction

REval is a simple framework for probing sentence-level representations of Relation Extraction models.

✅ Requirements

REval is tested with:

Python 3.7

🚀 Installation

With pip

<TBD>

From source

git clone https://github.com/DFKI-NLP/REval
cd REval
pip install -r requirements.txt

🔬 Probing

Supported Datasets

SemEval 2010 Task 8 (CoreNLP annotated version) [LINK]
TACRED (obtained via LDC) [LINK]

Probing Tasks

Task	SemEval 2010	TACRED
ArgTypeHead	✔️	✔️
ArgTypeTail	✔️	✔️
Length	✔️	✔️
EntityDistance	✔️	✔️
ArgumentOrder		✔️
EntityExistsBetweenHeadTail	✔️	✔️
PosTagHeadLeft	✔️	✔️
PosTagHeadRight	✔️	✔️
PosTagTailLeft	✔️	✔️
PosTagTailRight	✔️	✔️
TreeDepth	✔️	✔️
SDPTreeDepth	✔️	✔️
ArgumentHeadGrammaticalRole	✔️	✔️
ArgumentTailGrammaticalRole	✔️	✔️

🔧 Usage

Step 1: create the probing task datasets from the original datasets.

SemEval 2010 Task 8

python reval.py generate-all-from-semeval \
    --train-file <SEMEVAL DIR>/train.json \
    --validation-file <SEMEVAL DIR>/dev.json \
    --test-file <SEMEVAL DIR>/test.json \
    --output-dir ./data/semeval/

TACRED

python reval.py generate-all-from-tacred \
    --train-file <TACRED DIR>/train.json \
    --validation-file <TACRED DIR>/dev.json \
    --test-file <TACRED DIR>/test.json \
    --output-dir ./data/tacred/

Step 2: Run the probing tasks on a model.

For example, download a Relation Extraction model trained with RelEx, e.g., the CNN trained on SemEval.

mkdir -p models/cnn_semeval
wget --content-disposition https://cloud.dfki.de/owncloud/index.php/s/F3gf9xkeb2foTFe/download -P models/cnn_semeval

python probing_task_evaluation.py \
    --model-dir ./models/cnn_semeval/ \
    --data-dir ./data/semeval/ \
    --dataset semeval2010 \
    --cuda-device 0 \
    --batch-size 64 \
    --cache-representations

After the run is completed, the results are stored to probing_task_results.json in the model-dir.

{
    "ArgTypeHead": {
        "acc": 75.82,
        "devacc": 78.96,
        "ndev": 670,
        "ntest": 2283
    },
    "ArgTypeTail": {
        "acc": 75.4,
        "devacc": 78.79,
        "ndev": 627,
        "ntest": 2130
    },
    [...]
}

📚 Citation

If you use REval, please consider citing the following paper:

@inproceedings{alt-etal-2020-probing,
    title={Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction},
    author={Christoph Alt and Aleksandra Gabryszak and Leonhard Hennig},
    year={2020},
    booktitle={Proceedings of ACL},
    url={https://arxiv.org/abs/2004.08134}
}

📘 License

REval is released under the terms of the MIT License.

[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Related tags

Overview

REval

Table of Contents

🎓 Introduction

✅ Requirements

🚀 Installation

With pip

From source

🔬 Probing

Supported Datasets

Probing Tasks

🔧 Usage

Step 1: create the probing task datasets from the original datasets.

SemEval 2010 Task 8

TACRED

Step 2: Run the probing tasks on a model.

📚 Citation

📘 License

Owner

Example repository for custom C++/CUDA operators for TorchScript

Simultaneous Detection and Segmentation

classify fashion-mnist dataset with pytorch

Process text, including tokenizing and representing sentences as vectors and Applying some concepts like RNN, LSTM and GRU to create a classifier can detect the language in which a sentence is written from among 17 languages.

Learning to Reconstruct 3D Manhattan Wireframes from a Single Image

Official code release for: EditGAN: High-Precision Semantic Image Editing

A repo for Causal Imitation Learning under Temporally Correlated Noise

An implementation of EWC with PyTorch

Compositional and Parameter-Efficient Representations for Large Knowledge Graphs

A fast python implementation of Ray Tracing in One Weekend using python and Taichi

TensorFlow GNN is a library to build Graph Neural Networks on the TensorFlow platform.

某学校选课系统GIF验证码数据集 + Baseline模型 + 上下游相关工具

LaneDet is an open source lane detection toolbox based on PyTorch that aims to pull together a wide variety of state-of-the-art lane detection models

Seasonal Contrast: Unsupervised Pre-Training from Uncurated Remote Sensing Data

Morphable Detector for Object Detection on Demand

Activity image-based video retrieval

A Vision Transformer approach that uses concatenated query and reference images to learn the relationship between query and reference images directly.

Interactive Image Generation via Generative Adversarial Networks

Supplementary code for the AISTATS 2021 paper "Matern Gaussian Processes on Graphs".

Learning trajectory representations using self-supervision and programmatic supervision.