Source code for CAST - Crisis Domain Adaptation Using Sequence-to-sequence Transformers (Accepted to ISCRAM 2021, CorePaper).

Last update: Jul 14, 2021

Related tags

Deep Learning CAST

Overview

Source code for CAST: Crisis Domain Adaptation UsingSequence-to-sequenceTransformers (Paper, BibTeX, Accepted to ISCRAM 2021, CorePaper)

Quick start

Download the code

git clone https://github.com/wangcongcong123/CAST.git
cd CAST

Download the dataset from here, and extract the data to data/ dir (create it first if not exists).

Model training and testing for crisis domain adaptation:

# go to your python env
# install dependencies
pip install -r requirements.txt

# training and testing at one go

# here we run CAST on crisis_t6 as an example

python train_t6.py

# In train_t6.py, for quick configuration:

train_event_names => the source event(s)
test_event_name => the target event
data_config => postfix template: 't2t' (postQ) or 'normal' (standard) as described in the paper
model_select => the base seq2seq model: 't5-small' or 't5-base'

# For other configuration, just go for a bit hacking so should be easy.

# For nepal_queensland, similary run `python train_nepal_queensland.py`, go check and configure the script to reproduce the paper's results.

Cite

If you find this helpful for your work, consider to cite it as follows please:

@inproceedings{Wang2021a,
 title = {Crisis {{Domain Adaptation Using Sequence}}-to-Sequence {{Transformers}}},
 booktitle = {{{ISCRAM}} 2021 {{Conference Proceedings}} - 18th {{International Conference}} on {{Information Systems}} for {{Crisis Response}} and {{Management}}},
 author = {Wang, Congcong and Nulty, Paul and Lillis, David},
 year = {2021},
 month = {May},
 address = {{Blacksburg, VA, USA}},
 abstract = {User-generated content (UGC) on social media can act as a key source of information for emergency responders in crisis situations. However, due to the volume concerned, computational techniques are needed to effectively filter and prioritise this content as it arises during emerging events. In the literature, these techniques are trained using annotated content from previous crises. In this paper, we investigate how this prior knowledge can be best leveraged for new crises by examining the extent to which crisis events of a similar type are more suitable for adaptation to new events (cross-domain adaptation). Given the recent successes of transformers in various language processing tasks, we propose CAST: an approach for Crisis domain Adaptation leveraging Sequence-to-sequence Transformers. We evaluate CAST using two major crisis-related message classification datasets. Our experiments show that our CAST-based best run without using any target data achieves the state of the art performance in both in-domain and cross-domain contexts. Moreover, CAST is particularly effective in one-to-one cross-domain adaptation when trained with a larger language model. In many-to-one adaptation where multiple crises are jointly used as the source domain, CAST further improves its performance. In addition, we find that more similar events are more likely to bring better adaptation performance whereas fine-tuning using dissimilar events does not help for adaptation. To aid reproducibility, we open source our code to the community.},
}

Source code for CAST - Crisis Domain Adaptation Using Sequence-to-sequence Transformers (Accepted to ISCRAM 2021, CorePaper).

Related tags

Overview

Source code for CAST: Crisis Domain Adaptation UsingSequence-to-sequenceTransformers (Paper, BibTeX, Accepted to ISCRAM 2021, CorePaper)

Quick start

Cite

Owner

Congcong Wang

Full Transformer Framework for Robust Point Cloud Registration with Deep Information Interaction

The code succinctly shows how our ensemble learning based on deep learning CNN is used for LAM-avulsion-diagnosis.

NNR conformation conditional and global probabilities estimation and analysis in peptides or proteins fragments

OBBDetection is a oriented object detection library, which is based on MMdetection.

FinRL-Meta: A Universe for Data-Driven Financial Reinforcement Learning. 🔥

Ray tracing of a Schwarzschild black hole written entirely in TensorFlow.

Download and preprocess popular sequential recommendation datasets

Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.

The official implementation of the CVPR 2021 paper FAPIS: a Few-shot Anchor-free Part-based Instance Segmenter

CT-Net: Channel Tensorization Network for Video Classification

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

This is the implementation of the paper LiST: Lite Self-training Makes Efficient Few-shot Learners.

This code is a near-infrared spectrum modeling method based on PCA and pls

Adversarial Autoencoders

Code for GNMR in ICDE 2021

Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"

Python wrapper class for OpenVINO Model Server. User can submit inference request to OVMS with just a few lines of code

🥇 LG-AI-Challenge 2022 1위 솔루션 입니다.

A PyTorch implementation of Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks

Source code for CAST - Crisis Domain Adaptation Using Sequence-to-sequence Transformers (Accepted to ISCRAM 2021, CorePaper).

Related tags

Overview

Source code for CAST: Crisis Domain Adaptation UsingSequence-to-sequenceTransformers (Paper, BibTeX, Accepted to ISCRAM 2021, CorePaper)

Quick start

Cite

Owner

Congcong Wang

Full Transformer Framework for Robust Point Cloud Registration with Deep Information Interaction

The code succinctly shows how our ensemble learning based on deep learning CNN is used for LAM-avulsion-diagnosis.

NNR conformation conditional and global probabilities estimation and analysis in peptides or proteins fragments

OBBDetection is a oriented object detection library, which is based on MMdetection.

FinRL­-Meta: A Universe for Data­-Driven Financial Reinforcement Learning. 🔥

Ray tracing of a Schwarzschild black hole written entirely in TensorFlow.

Download and preprocess popular sequential recommendation datasets

Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.

The official implementation of the CVPR 2021 paper FAPIS: a Few-shot Anchor-free Part-based Instance Segmenter

CT-Net: Channel Tensorization Network for Video Classification

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

This is the implementation of the paper LiST: Lite Self-training Makes Efficient Few-shot Learners.

This code is a near-infrared spectrum modeling method based on PCA and pls

Adversarial Autoencoders

Code for GNMR in ICDE 2021

Implementation of a memory efficient multi-head attention as proposed in the paper, "Self-attention Does Not Need O(n²) Memory"

Python wrapper class for OpenVINO Model Server. User can submit inference request to OVMS with just a few lines of code

🥇 LG-AI-Challenge 2022 1위 솔루션 입니다.

A PyTorch implementation of Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks

FinRL-Meta: A Universe for Data-Driven Financial Reinforcement Learning. 🔥