Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Last update: Oct 18, 2022

Related tags

Deep Learning GCS_KI

Overview

Graph Convolution Simulator (GCS)

Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Requirements:

PyTorch and DGL should be installed based on your system. For other libraries, you can install them using the following command:

$ pip install -r requirements.txt

Run Knowledge Integration Interpretation (KI) by GCS on example data:

$ bash run_example.sh

Interpretation results are saved in ./example/example_data/gcs.edgelist.

If the knowledge graph is small, users can visualize it by ./example/example_data/results.pdf. Here is the results for the example data:

Run Knowledge Intergration Interpretation by GCS for your own model

Step 1: Prepare the entity embedding of vanilla LM and knowledge-enhanced LM:

Store them as PyTorch tensor (.pt) format. Make sure they have the same number of rows, and the indexes of entities are the same. The default files are emb_roberta.pt and emb_kadapter.pt.

Step 2: Prepare the knowledge graph:

Three files are needed to load the knowledge graph:

a) qid2idx.json: The index dictionary. The key is entity Q-label, and value is the index of entity in entity embedding
b) qid2label.json : The label dictionary. The key is entity Q-label, and the value is the entity label text. Note that this dictionary is only for visualization, you can set it as {Q-label: Q-label} if you don't have the text.
c) kg.edgelist: The knowledge triple to construct knowledge graph. Each row is for one triple as: entity1_idx \t entity2_idx \t {}.

Step 3: Run GCS for KI interpretation:

After two preparation steps, you can run GCS by:

$ python src/example.py  --emb_vlm emb_roberta.pt  -emb_klm emb_kadapter.pt  --data_dir ./example_data  --lr 1e-3  --loss mi_loss

As for the hyperparameters, users may check them in ./example/src/example.py. Note that for large knowledge graphs, we recommend to use mutual information loss (mi_loss), and please do not visualize the results for large knowledge graphs.

Step 4: Analyze GCS interpretation results:

The interpretation results are saved in ./example/example_data/gcs.edgelist. Each row is for one triple as: entity1_idx \t entity2_idx \t {'a': xxxx}. Here, the value of 'a' is the attention coefficient value on the triple/entity (entity1, r, entity2). Users may use them to analyze the factual knowledge learned during knowledge integration.

Reproduce the results in the paper

Please enter ./all_exp folder for more details

Cite

If you use the code, please cite the paper:

@article{hou2022understanding,
  title={Understanding Knowledge Integration in Language Models with Graph Convolutions},
  author={Hou, Yifan and Fu, Guoji and Sachan, Mrinmaya},
  journal={arXiv preprint arXiv:2202.00964},
  year={2022}
}

Contact

Feel free to open an issue or send me ([email protected]) an email if you have any questions!

Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Related tags

Overview

Graph Convolution Simulator (GCS)

Requirements:

Run Knowledge Integration Interpretation (KI) by GCS on example data:

Run Knowledge Intergration Interpretation by GCS for your own model

Step 1: Prepare the entity embedding of vanilla LM and knowledge-enhanced LM:

Step 2: Prepare the knowledge graph:

Step 3: Run GCS for KI interpretation:

Step 4: Analyze GCS interpretation results:

Reproduce the results in the paper

Cite

Contact

Owner

yifan

Bolt Online Learning Toolbox

A Web API for automatic background removal using Deep Learning. App is made using Flask and deployed on Heroku.

Aws-machine-learning-university-accelerated-tab - Machine Learning University: Accelerated Tabular Data Class

Minimalistic PyTorch training loop

Code for classifying international patents based on the text of their titles/abstracts

Diverse Image Generation via Self-Conditioned GANs

《Single Image Reflection Removal Beyond Linearity》(CVPR 2019)

A lightweight Python-based 3D network multi-agent simulator. Uses a cell-based congestion model. Calculates risk, loudness and battery capacities of the agents. Suitable for 3D network optimization tasks.

PyDeepFakeDet is an integrated and scalable tool for Deepfake detection.

Non-Homogeneous Poisson Process Intensity Modeling and Estimation using Measure Transport

Python Wrapper for Embree

Official code implementation for "Personalized Federated Learning using Hypernetworks"

A lightweight library to compare different PyTorch implementations of the same network architecture.

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Request execution of Galaxy SARS-CoV-2 variation analysis workflows on input data you provide.

SemiNAS: Semi-Supervised Neural Architecture Search

Orthogonal Over-Parameterized Training

Photo2cartoon - 人像卡通化探索项目 (photo-to-cartoon translation project)

[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code :star2:. Semi-supervised video object segmentation evaluation.

PSGAN running with ncnn⚡妆容迁移/仿妆⚡Imitation Makeup/Makeup Transfer⚡