[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Last update: Jan 06, 2023

Overview

REval

Introduction
Overview
Requirements
Installation
Probing
Usage
Citation
License

🎓 Introduction

REval is a simple framework for probing sentence-level representations of Relation Extraction models.

✅ Requirements

REval is tested with:

Python 3.7

🚀 Installation

With pip

<TBD>

From source

git clone https://github.com/DFKI-NLP/REval
cd REval
pip install -r requirements.txt

🔬 Probing

Supported Datasets

SemEval 2010 Task 8 (CoreNLP annotated version) [LINK]
TACRED (obtained via LDC) [LINK]

Probing Tasks

Task	SemEval 2010	TACRED
ArgTypeHead	✔️	✔️
ArgTypeTail	✔️	✔️
Length	✔️	✔️
EntityDistance	✔️	✔️
ArgumentOrder		✔️
EntityExistsBetweenHeadTail	✔️	✔️
PosTagHeadLeft	✔️	✔️
PosTagHeadRight	✔️	✔️
PosTagTailLeft	✔️	✔️
PosTagTailRight	✔️	✔️
TreeDepth	✔️	✔️
SDPTreeDepth	✔️	✔️
ArgumentHeadGrammaticalRole	✔️	✔️
ArgumentTailGrammaticalRole	✔️	✔️

🔧 Usage

Step 1: create the probing task datasets from the original datasets.

SemEval 2010 Task 8

python reval.py generate-all-from-semeval \
    --train-file <SEMEVAL DIR>/train.json \
    --validation-file <SEMEVAL DIR>/dev.json \
    --test-file <SEMEVAL DIR>/test.json \
    --output-dir ./data/semeval/

TACRED

python reval.py generate-all-from-tacred \
    --train-file <TACRED DIR>/train.json \
    --validation-file <TACRED DIR>/dev.json \
    --test-file <TACRED DIR>/test.json \
    --output-dir ./data/tacred/

Step 2: Run the probing tasks on a model.

For example, download a Relation Extraction model trained with RelEx, e.g., the CNN trained on SemEval.

mkdir -p models/cnn_semeval
wget --content-disposition https://cloud.dfki.de/owncloud/index.php/s/F3gf9xkeb2foTFe/download -P models/cnn_semeval

python probing_task_evaluation.py \
    --model-dir ./models/cnn_semeval/ \
    --data-dir ./data/semeval/ \
    --dataset semeval2010 \
    --cuda-device 0 \
    --batch-size 64 \
    --cache-representations

After the run is completed, the results are stored to probing_task_results.json in the model-dir.

{
    "ArgTypeHead": {
        "acc": 75.82,
        "devacc": 78.96,
        "ndev": 670,
        "ntest": 2283
    },
    "ArgTypeTail": {
        "acc": 75.4,
        "devacc": 78.79,
        "ndev": 627,
        "ntest": 2130
    },
    [...]
}

📚 Citation

If you use REval, please consider citing the following paper:

@inproceedings{alt-etal-2020-probing,
    title={Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction},
    author={Christoph Alt and Aleksandra Gabryszak and Leonhard Hennig},
    year={2020},
    booktitle={Proceedings of ACL},
    url={https://arxiv.org/abs/2004.08134}
}

📘 License

REval is released under the terms of the MIT License.

[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Related tags

Overview

REval

Table of Contents

🎓 Introduction

✅ Requirements

🚀 Installation

With pip

From source

🔬 Probing

Supported Datasets

Probing Tasks

🔧 Usage

Step 1: create the probing task datasets from the original datasets.

SemEval 2010 Task 8

TACRED

Step 2: Run the probing tasks on a model.

📚 Citation

📘 License

Owner

This repository contains source code for the Situated Interactive Language Grounding (SILG) benchmark

Good Classification Measures and How to Find Them

Using multidimensional LSTM neural networks to create a forecast for Bitcoin price

Automatic Calibration for Non-repetitive Scanning Solid-State LiDAR and Camera Systems

This project uses reinforcement learning on stock market and agent tries to learn trading. The goal is to check if the agent can learn to read tape. The project is dedicated to hero in life great Jesse Livermore.

GndNet: Fast ground plane estimation and point cloud segmentation for autonomous vehicles using deep neural networks.

Feature extraction made simple with torchextractor

[WWW 2021] Source code for "Graph Contrastive Learning with Adaptive Augmentation"

This is 2nd term discrete maths project done by UCU students that uses backtracking to solve various problems.

Supplementary materials to "Spin-optomechanical quantum interface enabled by an ultrasmall mechanical and optical mode volume cavity" by H. Raniwala, S. Krastanov, M. Eichenfield, and D. R. Englund, 2022

A reimplementation of DCGAN in PyTorch

A general and strong 3D object detection codebase that supports more methods, datasets and tools (debugging, recording and analysis).

PyTorch code for 'Efficient Single Image Super-Resolution Using Dual Path Connections with Multiple Scale Learning'

unofficial pytorch implementation of RefineGAN

An implementation of paper `Real-time Convolutional Neural Networks for Emotion and Gender Classification` with PaddlePaddle.

MegEngine implementation of YOLOX

Mail classification with tensorflow and MS Exchange Server (ham or spam).

😊 Python module for face feature changing

Company clustering with K-means/GMM and visualization with PCA, t-SNE, using SSAN relation extraction

DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency