Code for Generating Disentangled Arguments with Prompts: A Simple Event Extraction Framework that Works

Last update: Oct 29, 2022

Overview

GDAP

Code for Generating Disentangled Arguments with Prompts: A Simple Event Extraction Framework that Works

Environment

Python (verified: v3.8)
CUDA (verified: v11.1)
Packages (see requirements.txt)

Usage

Preprocessing

We follow dygiepp for data preprocessing.

text2et: Event Type Detection
ettext2tri: Trigger Extraction
etrttext2role: Argument Extraction

# data processed by dyieapp
data/text2target/dyiepp_ace1005_ettext2tri_subtype
├── event.schema 
├── test.json
├── train.json
└── val.json

# data processed by  data_convert.convert_text_to_target
data/text2target/dyiepp_ace1005_ettext2tri_subtype
├── event.schema
├── test.json
├── train.json
└── val.json

Useful commands:

python -m data_convert.convert_text_to_target # data/raw_data -> data/text2target
python convert_dyiepp_to_sentence.py data/raw_data/dyiepp_ace2005 # doc -> sentence, used in evaluation

Training

Relevant scripts:

run_seq2seq.py: Python code entry, modified from the transformers/examples/seq2seq/run_seq2seq.py
run_seq2seq_span.bash: Model training script logging to the log file.

Example (see the above two files for more details):

# ace05 event type detection t5-base, the metric_format use eval_trigger-F1 
bash run_seq2seq_span.bash --data=dyiepp_ace2005_text2et_subtype --model=t5-base --format=et --metric_format=eval_trigger-F1

# ace05 tri extraction t5-base
bash run_seq2seq_span.bash --data=dyiepp_ace2005_ettext2tri_subtype --model=t5-base --format=tri --metric_format=eval_trigger-F1

# ace05 argument extraction t5-base
bash run_seq2seq_span.bash --data=dyiepp_ace2005_etrttext2role_subtype --model=t5-base --format=role --metric_format=eval_role-F1

Trained models are saved in the models/ folder.

Evaluation

run_tri_predict.bash: trigger extraction evaluation and inference script.
run_arg_predict.bash: argument extraction evaluation and inference script.

Todo

We aim to expand the codebase for a wider range of tasks, including

Name Entity Recognition
Keyword Generation
Event Relation Identification

If you find this repo helpful...

Please give us a ⭐ and cite our paper as

@misc{si2021-GDAP,
      title={Generating Disentangled Arguments with Prompts: A Simple Event Extraction Framework that Works}, 
      author={Jinghui Si and Xutan Peng and Chen Li and Haotian Xu and Jianxin Li},
      year={2021},
      eprint={2110.04525},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

This project borrows code from Text2Event

Code for Generating Disentangled Arguments with Prompts: A Simple Event Extraction Framework that Works

Related tags

Overview

GDAP

Environment

Usage

Preprocessing

Training

Evaluation

Todo

If you find this repo helpful...

Owner

PyTorch implementation of MuseMorphose, a Transformer-based model for music style transfer.

Tree-based Search Graph for Approximate Nearest Neighbor Search

Uses OpenCV and Python Code to detect a face on the screen

MvtecAD unsupervised Anomaly Detection

Scaling and Benchmarking Self-Supervised Visual Representation Learning

Unsupervised Image Generation with Infinite Generative Adversarial Networks

Code for our NeurIPS 2021 paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains

Train emoji embeddings based on emoji descriptions.

Clinica is a software platform for clinical research studies involving patients with neurological and psychiatric diseases and the acquisition of multimodal data

Supercharging Imbalanced Data Learning WithCausal Representation Transfer

Official Pytorch implementation of the paper "Action-Conditioned 3D Human Motion Synthesis with Transformer VAE", ICCV 2021

Efficient Lottery Ticket Finding: Less Data is More

MapReader: A computer vision pipeline for the semantic exploration of maps at scale

PyTorch implementation of adversarial patch

Stratified Transformer for 3D Point Cloud Segmentation (CVPR 2022)

A PyTorch implementation of "Graph Classification Using Structural Attention" (KDD 2018).

Rasterize with the least efforts for researchers.

Applicator Kit for Modo allow you to apply Apple ARKit Face Tracking data from your iPhone or iPad to your characters in Modo.

Official project repository for 'Normality-Calibrated Autoencoder for Unsupervised Anomaly Detection on Data Contamination'

[WWW 2021] Source code for "Graph Contrastive Learning with Adaptive Augmentation"