Extracting knowledge graphs from language models as a diagnostic benchmark of model performance.

Last update: Oct 25, 2022

Overview

Interpreting Language Models Through Knowledge Graph Extraction

Idea: How do we interpret what a language model learns at various stages of training? Language models have been recently described as open knowledge bases. We can generate knowledge graphs by extracting relation triples from masked language models at sequential epochs or architecture variants to examine the knowledge acquisition process.

Dataset: Squad, Google-RE (3 flavors)

Models: BERT, RoBeRTa, DistilBert, training RoBERTa from scratch

Authors: Vinitra Swamy, Angelika Romanou, Martin Jaggi

This repository is the official implementation of the NeurIPS 2021 XAI4Debugging paper titled "Interpreting Language Models Through Knowledge Graph Extraction". Found this work useful? Please cite our paper.

Quick Start Guide

Pretrained Model (BERT, DistilBERT, RoBERTa) -> Knowlege Graph

Install requirements and clone repository

git clone https://github.com/epfml/interpret-lm-knowledge.git
pip install git+https://github.com/huggingface/transformers   
pip install textacy
cd interpret-lm-knowledge/scripts

Generate knowledge graphs and dataframes python run_knowledge_graph_experiments.py <dataset> <model> <use_spacy>
e.g. squad Bert spacy
e.g. re-place-birth Roberta

options:

dataset=squad - "squad", "re-place-birth", "re-date-birth", "re-place-death"  
model=Roberta - "Bert", "Roberta", "DistilBert"  
extractor=spacy - "spacy", "textacy", "custom"

See run_lm_experiments notebook for examples.

Train LM model from scratch -> Knowledge Graph

Install requirements and clone repository

!pip install git+https://github.com/huggingface/transformers
!pip list | grep -E 'transformers|tokenizers'
!pip install textacy

Run wikipedia_train_from_scratch_lm.ipynb.
As included in the last cell of the notebook, you can run the KG generation experiments by:

from run_training_kg_experiments import *
run_experiments(tokenizer, model, unmasker, "Roberta3e")

Citations

@inproceedings{swamy2021interpreting,
 author = {Swamy, Vinitra and Romanou, Angelika and Jaggi, Martin},
 booktitle = {Advances in Neural Information Processing Systems, Workshop on eXplainable AI Approaches for Debugging and Diagnosis},
 title = {Interpreting Language Models Through Knowledge Graph Extraction},
 volume = {35},
 year = {2021}
}

Extracting knowledge graphs from language models as a diagnostic benchmark of model performance.

Related tags

Overview

Interpreting Language Models Through Knowledge Graph Extraction

Quick Start Guide

Pretrained Model (BERT, DistilBERT, RoBERTa) -> Knowlege Graph

Train LM model from scratch -> Knowledge Graph

Citations

Owner

EPFL Machine Learning and Optimization Laboratory

Code For TDEER: An Efficient Translating Decoding Schema for Joint Extraction of Entities and Relations (EMNLP2021)

[AAAI22] Reliable Propagation-Correction Modulation for Video Object Segmentation

Fast Scattering Transform with CuPy/PyTorch

Code for intrusion detection system (IDS) development using CNN models and transfer learning

Mesh TensorFlow: Model Parallelism Made Easier

GLIP: Grounded Language-Image Pre-training

Unofficial Implementation of RobustSTL: A Robust Seasonal-Trend Decomposition Algorithm for Long Time Series (AAAI 2019)

Using deep learning to predict gene structures of the coding genes in DNA sequences of Arabidopsis thaliana

A crossplatform menu bar application using mpv as DLNA Media Renderer.

Prompt Tuning with Rules

Official implementation of Rethinking Graph Neural Architecture Search from Message-passing (CVPR2021)

Cmsc11 arcade - Final Project for CMSC11

Repo for Photon-Starved Scene Inference using Single Photon Cameras, ICCV 2021

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

A machine learning benchmark of in-the-wild distribution shifts, with data loaders, evaluators, and default models.

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

Robotics with GPU computing

[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

Efficiently Disentangle Causal Representations