Code for our paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

Last update: Dec 12, 2022

Related tags

Deep Learning SimCLS

Overview

SimCLS

Code for our paper: "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

1. How to Install

Requirements

python3
conda create --name env --file spec-file.txt
pip3 install -r requirements.txt

Description of Codes

main.py -> training and evaluation procedure
model.py -> models
data_utils.py -> dataloader
utils.py -> utility functions
preprocess.py -> data preprocessing

Workspace

Following directories should be created for our experiments.

./cache -> storing model checkpoints
./result -> storing evaluation results

2. Preprocessing

We use the following datasets for our experiments.

CNN/DailyMail -> https://github.com/abisee/cnn-dailymail
XSum -> https://github.com/EdinburghNLP/XSum

For data preprocessing, please run

python preprocess.py --src_dir [path of the raw data] --tgt_dir [output path] --split [train/val/test] --cand_num [number of candidate summaries]

src_dir should contain the following files (using test split as an example):

test.source
test.source.tokenized
test.target
test.target.tokenized
test.out
test.out.tokenized

Each line of these files should contain a sample. In particular, you should put the candidate summaries for one data sample at neighboring lines in test.out and test.out.tokenized.

The preprocessing precedure will store the processed data as seperate json files in tgt_dir.

We have provided an example file in ./example.

3. How to Run

Hyper-parameter Setting

You may specify the hyper-parameters in main.py.

Train

python main.py --cuda --gpuid [list of gpuid] -l

Fine-tune

python main.py --cuda --gpuid [list of gpuid] -l --model_pt [model path]

Evaluate

python main.py --cuda --gpuid [single gpu] -e --model_pt [model path]

4. Results

CNNDM

	ROUGE-1	ROUGE-2	ROUGE-L
BART	44.39	21.21	41.28
Ours	46.67	22.15	43.54

XSum

	ROUGE-1	ROUGE-2	ROUGE-L
Pegasus	47.10	24.53	39.23
Ours	47.61	24.57	39.44

Our model outputs on these datasets can be found in ./output.

Code for our paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

Related tags

Overview

SimCLS

1. How to Install

Requirements

Description of Codes

Workspace

2. Preprocessing

3. How to Run

Hyper-parameter Setting

Train

Fine-tune

Evaluate

4. Results

CNNDM

XSum

Owner

Yixin Liu

Sharing of contents on mitochondrial encounter networks

AMTML-KD: Adaptive Multi-teacher Multi-level Knowledge Distillation

Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"

Federated Deep Reinforcement Learning for the Distributed Control of NextG Wireless Networks.

Predict and time series avocado hass

Direct design of biquad filter cascades with deep learning by sampling random polynomials.

Weakly Supervised Segmentation by Tensorflow.

Official implementation of MSR-GCN (ICCV 2021 paper)

Object detection GUI based on PaddleDetection

This repository contains the code for using the H3DS dataset introduced in H3D-Net: Few-Shot High-Fidelity 3D Head Reconstruction

Localized representation learning from Vision and Text (LoVT)

DEEPAGÉ: Answering Questions in Portuguese about the Brazilian Environment

SporeAgent: Reinforced Scene-level Plausibility for Object Pose Refinement

Per-Pixel Classification is Not All You Need for Semantic Segmentation

Elevation Mapping on GPU.

[ICML 2020] "When Does Self-Supervision Help Graph Convolutional Networks?" by Yuning You, Tianlong Chen, Zhangyang Wang, Yang Shen

A `Neural = Symbolic` framework for sound and complete weighted real-value logic

Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness

2.86% and 15.85% on CIFAR-10 and CIFAR-100

Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).