Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Last update: Oct 18, 2022

Related tags

Deep Learning GCS_KI

Overview

Graph Convolution Simulator (GCS)

Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Requirements:

PyTorch and DGL should be installed based on your system. For other libraries, you can install them using the following command:

$ pip install -r requirements.txt

Run Knowledge Integration Interpretation (KI) by GCS on example data:

$ bash run_example.sh

Interpretation results are saved in ./example/example_data/gcs.edgelist.

If the knowledge graph is small, users can visualize it by ./example/example_data/results.pdf. Here is the results for the example data:

Run Knowledge Intergration Interpretation by GCS for your own model

Step 1: Prepare the entity embedding of vanilla LM and knowledge-enhanced LM:

Store them as PyTorch tensor (.pt) format. Make sure they have the same number of rows, and the indexes of entities are the same. The default files are emb_roberta.pt and emb_kadapter.pt.

Step 2: Prepare the knowledge graph:

Three files are needed to load the knowledge graph:

a) qid2idx.json: The index dictionary. The key is entity Q-label, and value is the index of entity in entity embedding
b) qid2label.json : The label dictionary. The key is entity Q-label, and the value is the entity label text. Note that this dictionary is only for visualization, you can set it as {Q-label: Q-label} if you don't have the text.
c) kg.edgelist: The knowledge triple to construct knowledge graph. Each row is for one triple as: entity1_idx \t entity2_idx \t {}.

Step 3: Run GCS for KI interpretation:

After two preparation steps, you can run GCS by:

$ python src/example.py  --emb_vlm emb_roberta.pt  -emb_klm emb_kadapter.pt  --data_dir ./example_data  --lr 1e-3  --loss mi_loss

As for the hyperparameters, users may check them in ./example/src/example.py. Note that for large knowledge graphs, we recommend to use mutual information loss (mi_loss), and please do not visualize the results for large knowledge graphs.

Step 4: Analyze GCS interpretation results:

The interpretation results are saved in ./example/example_data/gcs.edgelist. Each row is for one triple as: entity1_idx \t entity2_idx \t {'a': xxxx}. Here, the value of 'a' is the attention coefficient value on the triple/entity (entity1, r, entity2). Users may use them to analyze the factual knowledge learned during knowledge integration.

Reproduce the results in the paper

Please enter ./all_exp folder for more details

Cite

If you use the code, please cite the paper:

@article{hou2022understanding,
  title={Understanding Knowledge Integration in Language Models with Graph Convolutions},
  author={Hou, Yifan and Fu, Guoji and Sachan, Mrinmaya},
  journal={arXiv preprint arXiv:2202.00964},
  year={2022}
}

Contact

Feel free to open an issue or send me ([email protected]) an email if you have any questions!

Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Related tags

Overview

Graph Convolution Simulator (GCS)

Requirements:

Run Knowledge Integration Interpretation (KI) by GCS on example data:

Run Knowledge Intergration Interpretation by GCS for your own model

Step 1: Prepare the entity embedding of vanilla LM and knowledge-enhanced LM:

Step 2: Prepare the knowledge graph:

Step 3: Run GCS for KI interpretation:

Step 4: Analyze GCS interpretation results:

Reproduce the results in the paper

Cite

Contact

Owner

yifan

The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

Implementation of Neonatal Seizure Detection using EEG signals for deploying on edge devices including Raspberry Pi.

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

An open software package to develop BCI based brain and cognitive computing technology for recognizing user's intention using deep learning

PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch

Implementation of character based convolutional neural network

A basic neural network for image segmentation.

Deep learning with dynamic computation graphs in TensorFlow

Cross Quality LFW: A database for Analyzing Cross-Resolution Image Face Recognition in Unconstrained Environments

Evaluating AlexNet features at various depths

An official implementation of "Background-Aware Pooling and Noise-Aware Loss for Weakly-Supervised Semantic Segmentation" (CVPR 2021) in PyTorch.

Temporally Efficient Vision Transformer for Video Instance Segmentation, CVPR 2022, Oral

Fuzzification helps developers protect the released, binary-only software from attackers who are capable of applying state-of-the-art fuzzing techniques

Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite

LVI-SAM: Tightly-coupled Lidar-Visual-Inertial Odometry via Smoothing and Mapping

DiAne is a smart fuzzer for IoT devices

A scanpy extension to analyse single-cell TCR and BCR data.

Discovering Dynamic Salient Regions with Spatio-Temporal Graph Neural Networks

TrTr: Visual Tracking with Transformer

Code of the paper "Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition"