The repo contains the code to train and evaluate a system which extracts relations and explanations from dialogue.

Last update: Mar 31, 2022

Related tags

Deep Learning D-REX

Overview

The repo contains the code to train and evaluate a system which extracts relations and explanations from dialogue.

How do I cite D-REX?

For now, cite the Arxiv paper

@article{albalak2021drex,
      title={D-REX: Dialogue Relation Extraction with Explanations}, 
      author={Alon Albalak and Varun Embar and Yi-Lin Tuan and Lise Getoor and William Yang Wang},
      journal={arXiv preprint arXiv:2109.05126},
      year={2021},
}

To train the full system:

GPU=0
bash train_drex_system.sh $GPU

Notes:

The training script is set up to work with an NVIDIA Titan RTX (24Gb memory, mixed-precision)
To train on a GPU with less memory, adjust the GPU_BATCH_SIZE parameter in train_drex_system.sh to match your memory limit.
Training the full system takes ~24 hours on a single NVIDIA Titan RTX

To test the trained system:

GPU=0
bash test_drex_system.sh $GPU

To train/test individual modules:

Relation Extraction Model -

Training:

GPU=0
MODEL_PATH=relation_extraction_model
mkdir $MODEL_PATH
CUDA_VISIBLE_DEVICES=$GPU python3 train_relation_extraction_model.py \
    --model_class=relation_extraction_roberta \
    --model_name_or_path=roberta-base \
    --base_model=roberta-base \
    --effective_batch_size=30 \
    --gpu_batch_size=30 \
    --fp16 \
    --output_dir=$MODEL_PATH \
    --relation_extraction_pretraining \
    > $MODEL_PATH/train_outputs.log

Testing:

GPU=0
MODEL_PATH=relation_extraction_model
BEST_MODEL=$(ls $MODEL_PATH/F1* -d | sort -r | head -n 1)
THRESHOLD1=$(echo $BEST_MODEL | grep -o "T1.....")
THRESHOLD1=${THRESHOLD1: -2}
THRESHOLD2=$(echo $BEST_MODEL | grep -o "T2.....")
THRESHOLD2=${THRESHOLD2: -2}
CUDA_VISIBLE_DEVICES=0 python3 test_relation_extraction_model.py \
    --model_class=relation_extraction_roberta \
    --model_name_or_path=$BEST_MODEL \
    --base_model=roberta-base \
    --relation_extraction_pretraining \
    --threshold1=$THRESHOLD1 \
    --threshold2=$THRESHOLD2 \
    --data_split=test

Explanation Extraction Model -

Training:

GPU=0
MODEL_PATH=explanation_extraction_model
mkdir $MODEL_PATH
CUDA_VISIBLE_DEVICES=$GPU python3 train_explanation_policy.py \
    --model_class=explanation_policy_roberta \
    --model_name_or_path=roberta-base \
    --base_model=roberta-base \
    --effective_batch_size=30 \
    --gpu_batch_size=30 \
    --fp16 \
    --output_dir=$MODEL_PATH \
    --explanation_policy_pretraining \
    > $MODEL_PATH/train_outputs.log

Testing:

GPU=0
MODEL_PATH=explanation_extraction_model
BEST_MODEL=$(ls $MODEL_PATH/F1* -d | sort -r | head -n 1)
CUDA_VISIBLE_DEVICES=$GPU python3 test_explanation_policy.py \
    --model_class=explanation_policy_roberta \
    --model_name_or_path=$BEST_MODEL \
    --base_model=roberta-base \
    --explanation_policy_pretraining \
    --data_split=test

The repo contains the code to train and evaluate a system which extracts relations and explanations from dialogue.

Related tags

Overview

The repo contains the code to train and evaluate a system which extracts relations and explanations from dialogue.

How do I cite D-REX?

To train the full system:

To test the trained system:

To train/test individual modules:

Owner

Alon Albalak

SOTA model in CIFAR10

A Multi-modal Perception Tracker (MPT) for speaker tracking using both audio and visual modalities

Numerai tournament example scripts using NN and optuna

NeuroMorph: Unsupervised Shape Interpolation and Correspondence in One Go

Implementation of Kronecker Attention in Pytorch

WSDM2022 "A Simple but Effective Bidirectional Extraction Framework for Relational Triple Extraction"

ICON: Implicit Clothed humans Obtained from Normals

ONNX Runtime Web demo is an interactive demo portal showing real use cases running ONNX Runtime Web in VueJS.

Implementation of "With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition, BMVC, 2021" in PyTorch

Multi-Template Mouse Brain MRI Atlas (MBMA): both in-vivo and ex-vivo

Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch.

Deploy optimized transformer based models on Nvidia Triton server

P-Tuning v2: Prompt Tuning Can Be Comparable to Finetuning Universally Across Scales and Tasks

Unofficial PyTorch implementation of TokenLearner by Google AI

Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities

This repository collects project-relevant Isabelle/HOL formalizations.

Large dataset storage format for Pytorch

The codes I made while I practiced various TensorFlow examples

Face Transformer for Recognition

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.