Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

Last update: Dec 28, 2022

Overview

MKGFormer

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

Model Architecture

Illustration of MKGformer for (a) Unified Multimodal KGC Framework and (b) Detailed M-Encoder.

Requirements

To run the codes, you need to install the requirements:

pip install -r requirements.txt

Data Collection

The datasets that we used in our experiments are as follows:

Twitter2017

You can download the twitter2017 dataset via this link (https://drive.google.com/file/d/1ogfbn-XEYtk9GpUECq1-IwzINnhKGJqy/view?usp=sharing)

For more information regarding the dataset, please refer to the UMT repository.
MRE

The MRE dataset comes from MEGA, many thanks.

You can download the MRE dataset with detected visual objects using folloing command:
```
cd MRE
wget 120.27.214.45/Data/re/multimodal/data.tar.gz
tar -xzvf data.tar.gz
```
MKG
- FB15K-237-IMG
  
  For more information regarding the dataset, please refer to the mmkb and kg-bert repositories.
- WN18-IMG
  
  For more information regarding the dataset, please refer to the RSME repository.

The expected structure of files is:

MKGFormer
 |-- MKG	# Multimodal Knowledge Graph
 |    |-- dataset       # task data
 |    |-- data          # data process file
 |    |-- lit_models    # lightning model
 |    |-- models        # mkg model
 |    |-- scripts       # running script
 |    |-- main.py   
 |-- MNER	# Multimodal Named Entity Recognition
 |    |-- data          # task data
 |    |-- models        # mner model
 |    |-- modules       # running script
 |    |-- processor     # data process file
 |    |-- utils
 |    |-- run_mner.sh
 |    |-- run.py
 |-- MRE    # Multimodal Relation Extraction
 |    |-- data          # task data
 |    |-- models        # mre model
 |    |-- modules       # running script
 |    |-- processor     # data process file
 |    |-- run_mre.sh
 |    |-- run.py

How to run

MKG Task
- First run Image-text Incorporated Entity Modeling to train entity embedding.
```
    cd MKG
    bash scripts/pretrain_fb15k-237-image.sh
```
- Then do Missing Entity Prediction.
```
    bash scripts/fb15k-237-image.sh
```
MNER Task

To run mner task, run this script.
```
cd MNER
bash run_mner.py
```
MRE Task

To run mre task, run this script.
```
cd MRE
bash run_mre.py
```

Acknowledgement

The acquisition of image data for the multimodal link prediction task refer to the code from https://github.com/wangmengsd/RSME, many thanks.

Papers for the Project & How to Cite

If you use or extend our work, please cite the paper as follows:

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

Related tags

Overview

MKGFormer

Model Architecture

Requirements

Data Collection

How to run

MKG Task

MNER Task

MRE Task

Acknowledgement

Papers for the Project & How to Cite

Owner

ZJUNLP

PyTorch code for Composing Partial Differential Equations with Physics-Aware Neural Networks

A collection of semantic image segmentation models implemented in TensorFlow

Framework for estimating the structures and parameters of Bayesian networks (DAGs) at per-sample resolution

This is the official implementation of VaxNeRF (Voxel-Accelearated NeRF).

PyTorch implementation of "Contrast to Divide: self-supervised pre-training for learning with noisy labels"

Open source person re-identification library in python

Code release for Convolutional Two-Stream Network Fusion for Video Action Recognition

ESTDepth: Multi-view Depth Estimation using Epipolar Spatio-Temporal Networks (CVPR 2021)

Jupyter Dock is a set of Jupyter Notebooks for performing molecular docking protocols interactively, as well as visualizing, converting file formats and analyzing the results.

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

This repository contains the implementation of the paper: "Towards Frequency-Based Explanation for Robust CNN"

A PaddlePaddle implementation of Time Interval Aware Self-Attentive Sequential Recommendation.

"Graph Neural Controlled Differential Equations for Traffic Forecasting", AAAI 2022

A PyTorch implementation of QANet.

🛰️ Awesome Satellite Imagery Datasets

Deep Multi-Magnification Network for multi-class tissue segmentation of whole slide images

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

This repository contains the code for the paper "PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization"

Rethinking of Pedestrian Attribute Recognition: A Reliable Evaluation under Zero-Shot Pedestrian Identity Setting

Human Dynamics from Monocular Video with Dynamic Camera Movements