Code for layerwise detection of linguistic anomaly paper (ACL 2021)

Last update: Dec 07, 2022

Related tags

Overview

Layerwise Anomaly

This repository contains the source code and data for our ACL 2021 paper: "How is BERT surprised? Layerwise detection of linguistic anomalies" by Bai Li, Zining Zhu, Guillaume Thomas, Yang Xu, and Frank Rudzicz.

Citation

If you use our work in your research, please cite:

Li, B., Zhu, Z., Thomas, G., Xu, Y., and Rudzicz, F. (2021) How is BERT surprised? Layerwise detection of linguistic anomalies. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL).

@inproceedings{li2021layerwise,
  author = "Li, Bai and Zhu, Zining and Thomas, Guillaume and Xu, Yang and Rudzicz, Frank",
  title = "How is BERT surprised? Layerwise detection of linguistic anomalies",
  booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL)",
  publisher = "Association for Computational Linguistics",
  year = "2021",
}

Dependencies

The project was developed with the following library versions. Running with other versions may crash or produce incorrect results.

Python 3.7.5
CUDA Version: 11.0
torch==1.7.1
transformers==4.5.1
numpy==1.19.0
pandas==0.25.3
scikit-learn==0.22

Setup Instructions

Clone this repo: git clone https://github.com/SPOClab-ca/layerwise-anomaly
Download BNC Baby (4m word sample) from this link and extract into data/bnc/
Run BNC preprocessing script: python scripts/process_bnc.py --bnc_dir=data/bnc/download/Texts --to=data/bnc.pkl
Clone BLiMP repo: cd data && git clone https://github.com/alexwarstadt/blimp

GMM experiments on BLiMP (Figure 2 and Appendix A)

PYTHONPATH=. time python scripts/blimp_anomaly.py \
  --bnc_path=data/bnc.pkl \
  --blimp_path=data/blimp/data/ \
  --out=blimp_result

Frequency correlation (Figure 3 and Appendix B)

Run the notebooks/FreqSurprisal.ipynb notebook.

Surprisal gap experiments (Figure 4)

PYTHONPATH=. time python scripts/run_surprisal_gaps.py \
  --bnc_path=data/bnc.pkl \
  --out=surprisal_gaps

Accuracy scores (Table 2)

PYTHONPATH=. time python scripts/run_accuracy.py \
  --model_name=roberta-base \
  --anomaly_model=gmm

Run unit tests

PYTHONPATH=. pytest tests

Code for layerwise detection of linguistic anomaly paper (ACL 2021)

Related tags

Overview

Layerwise Anomaly

Citation

Dependencies

Setup Instructions

GMM experiments on BLiMP (Figure 2 and Appendix A)

Frequency correlation (Figure 3 and Appendix B)

Surprisal gap experiments (Figure 4)

Accuracy scores (Table 2)

Run unit tests

Owner

ICLR 2021 i-Mix: A Domain-Agnostic Strategy for Contrastive Representation Learning

eXPeditious Data Transfer

Think Big, Teach Small: Do Language Models Distil Occam’s Razor?

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

GPU-accelerated PyTorch implementation of Zero-shot User Intent Detection via Capsule Neural Networks

Management Dashboard for Torchserve

The code from the paper Character Transformations for Non-Autoregressive GEC Tagging

PyTorch implementation of ICLR 2022 paper PiCO: Contrastive Label Disambiguation for Partial Label Learning

CS50's Introduction to Artificial Intelligence Test Scripts

You Only Sample (Almost) Once: Linear Cost Self-Attention Via Bernoulli Sampling

Code and data for paper "Deep Photo Style Transfer"

[ACM MM 2021] Diverse Image Inpainting with Bidirectional and Autoregressive Transformers

Exploring Simple Siamese Representation Learning

NAS Benchmark in "Prioritized Architecture Sampling with Monto-Carlo Tree Search", CVPR2021

This repository contains code from the paper "TTS-GAN: A Transformer-based Time-Series Generative Adversarial Network"

Grammar Induction using a Template Tree Approach

📚 A collection of all the Deep Learning Metrics that I came across which are not accuracy/loss.

FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image Classification

Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization

Similarity-based Gray-box Adversarial Attack Against Deep Face Recognition