Code for our paper "Graph Pre-training for AMR Parsing and Generation" in ACL2022

Last update: Jan 03, 2023

Overview

AMRBART

An implementation for ACL2022 paper "Graph Pre-training for AMR Parsing and Generation". You may find our paper here (Arxiv).

Requirements

python 3.8
pytorch 1.8
transformers 4.8.2
pytorch-lightning 1.5.0
Tesla V100 or A100

We recommend to use conda to manage virtual environments:

conda env update --name <env> --file requirements.yml

We also provide a docker image here.

Data Processing

You may download the AMR corpora at LDC.

We follow Spring to preprocess AMR graphs:

# 1. install spring 
cd spring && pip install -e .
# 2. processing data
bash run-preprocess.sh

Pre-training

bash run-posttrain-bart-textinf-joint-denoising-6task-large-unified-V100.sh /path/to/BART/

Fine-tuning

For AMR Parsing, run

bash finetune_AMRbart_amrparsing.sh /path/to/pre-trained/AMRBART/ gpu_id

For AMR-to-text Generation, run

bash finetune_AMRbart_amr2text.sh /path/to/pre-trained/AMRBART/ gpu_id

Evaluation

For AMR Parsing, run

bash eval_AMRbart_amrparsing.sh /path/to/fine-tuned/AMRBART/ gpu_id

For AMR-to-text Generation, run

bash eval_AMRbart_amr2text.sh /path/to/fine-tuned/AMRBART/ gpu_id

Inference on your own data

If you want to run our code on your own data, try to transform your data into the format here, then run

For AMR Parsing, run

bash inference_amr.sh /path/to/fine-tuned/AMRBART/ gpu_id

For AMR-to-text Generation, run

bash inference_text.sh /path/to/fine-tuned/AMRBART/ gpu_id

Pre-trained Models

Pre-trained AMRBART

Setting	Params	checkpoint
AMRBART-base	142M	model
AMRBART-large	409M	model

Fine-tuned models on AMR-to-Text Generation

Setting	BLEU(tok)	BLEU(detok)	checkpoint	output
AMRBART-large (AMR2.0)	49.8	45.7	model	output
AMRBART-large (AMR3.0)	49.2	45.0	model	output

To get the tokenized bleu score, you need to use the scorer we provide here. We use this script in order to ensure comparability with previous approaches.

Fine-tuned models on AMR Parsing

Setting	Smatch	checkpoint	output
AMRBART-large (AMR2.0)	85.4	model	output
AMRBART-large (AMR3.0)	84.2	model	output

Todo

clean code

References

@inproceedings{bai-etal-2022-graph,
    title = "Graph Pre-training for {AMR} Parsing and Generation",
    author = "Bai, Xuefeng  and
      Chen, Yulong and
      Zhang, Yue",
    booktitle = "Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = may,
    year = "2022",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "todo",
    doi = "todo",
    pages = "todo"
}

Code for our paper "Graph Pre-training for AMR Parsing and Generation" in ACL2022

Related tags

Overview

AMRBART

Requirements

Data Processing

Pre-training

Fine-tuning

Evaluation

Inference on your own data

Pre-trained Models

Pre-trained AMRBART

Fine-tuned models on AMR-to-Text Generation

Fine-tuned models on AMR Parsing

Todo

References

Owner

xfbai

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

Official code for "Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer. ICCV2021".

Official implementation of "Learning Forward Dynamics Model and Informed Trajectory Sampler for Safe Quadruped Navigation" (RSS 2022)

The coda and data for "Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach" (ACL '21)

Think Big, Teach Small: Do Language Models Distil Occam’s Razor?

Use AI to generate a optimized stock portfolio

Offcial implementation of "A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction, ICCV-2021".

HGCAE Pytorch implementation. CVPR2021 accepted.

Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.

Julia package for contraction of tensor networks, based on the sweep line algorithm outlined in the paper General tensor network decoding of 2D Pauli codes

🌾 PASTIS 🌾 Panoptic Agricultural Satellite TIme Series

Bare bones use-case for deploying a containerized web app (built in streamlit) on AWS.

Temporal Segment Networks (TSN) in PyTorch

[ICML 2020] "When Does Self-Supervision Help Graph Convolutional Networks?" by Yuning You, Tianlong Chen, Zhangyang Wang, Yang Shen

Official implementation of Pixel-Level Bijective Matching for Video Object Segmentation

codebase for "A Theory of the Inductive Bias and Generalization of Kernel Regression and Wide Neural Networks"

AMTML-KD: Adaptive Multi-teacher Multi-level Knowledge Distillation

Making a music video with Wav2CLIP and VQGAN-CLIP

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

Various operations like path tracking, counting, etc by using yolov5