Knowledge Base Completion Meets Transfer Learning

This code accompanies the paper Knowledge Base Completion Meets Transfer Learning published at EMNLP 2021.

Setup

Following packages are needed to run the code

Python >=3.6
pytorch>=1.6
spacy>=2.0 and en_core_web_sm model
tqdm

Run setup.sh to download and transform GloVe embeddings and OlpBench dataset. Please note that this downloads 3.5GB of files which unzip into around 10GB of content.

Running the code

For full help, run python main.py -h, a couple of examples are given below:

For pre-training TuckER on OlpBench, run

python main.py -data Data/OlpBench -dim 300 -lr 1e-4 -batch 4096 -n_epochs 100 -embedding TuckER -dropout 0.3 -encoder GRU -hits [1,3,5,10,30,50] -output_dir TuckEROlpBench300 -dump_vocab -only_batch_negative

For pretraining, it is important to add -dump_vocab to store encoder vocabulary. Otherwise it is not possible to load the stored model for fine-tuning. For any large-scale pre-training it is important to add -only_batch_negative argument to avoid encoding all entities at every training step.

To fine-tune the model obtained with the above command on ReVerb20K using NoEncoder, use the command below.

python main.py -data Data/ReVerb20K -dim 300 -lr 3e-4 -batch 512 -n_epochs 500 -embedding TuckER -dropout 0.3 -encoder NoEncoder -hits [1,3,5,10,30,50] -output_dir TuckERReVerb20K -pretrained_dir TuckEROlpBench300

To train the with same setup but from a randomly-initialized model, just remove the -pretrained_dir argument.

python main.py -data Data/ReVerb20K -dim 300 -lr 3e-4 -batch 512 -n_epochs 500 -embedding TuckER -dropout 0.3 -encoder NoEncoder -hits [1,3,5,10,30,50] -output_dir TuckERReVerb20K

Reference

If you use the code from this repo, please cite the following work.

@inproceedings{kocijan2021KBCtransfer,
    title = "Knowledge Base Completion Meets Transfer Learning",
    author = "Kocijan, Vid  and
      Lukasiewicz, Thomas",
    booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP)",
    month = nov,
    year = "2021",
    address = "Punta Cana, Dominican Republic",
    publisher = "Association for Computational Linguistics",
}

Code accompanying the paper "Knowledge Base Completion Meets Transfer Learning"

Related tags

Overview

Knowledge Base Completion Meets Transfer Learning

Setup

Running the code

Reference

Owner

Attention over nodes in Graph Neural Networks using PyTorch (NeurIPS 2019)

ISBI 2022: Cross-level Contrastive Learning and Consistency Constraint for Semi-supervised Medical Image.

[CVPR 2021] MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition

COD-Rank-Localize-and-Segment (CVPR2021)

DSL for matching Python ASTs

On-device speech-to-intent engine powered by deep learning

Tracking code for the winner of track 1 in the MMP-Tracking Challenge at ICCV 2021 Workshop.

An official implementation of the paper Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers

The open source code of SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation.

Self-Supervised Monocular DepthEstimation with Internal Feature Fusion(arXiv), BMVC2021

CS506-Spring2022 - Code and Slides for Boston University CS 506

A curated list of awesome game datasets, and tools to artificial intelligence in games

PyTorch implementation of Densely Connected Time Delay Neural Network

ONNX Runtime Web demo is an interactive demo portal showing real use cases running ONNX Runtime Web in VueJS.

DenseNet Implementation in Keras with ImageNet Pretrained Models

TAUFE: Task-Agnostic Undesirable Feature DeactivationUsing Out-of-Distribution Data

Official implementation of Deep Burst Super-Resolution

Analysis of Smiles through reservoir sampling & RDkit

smc.covid is an R package related to the paper A sequential Monte Carlo approach to estimate a time varying reproduction number in infectious disease models: the COVID-19 case by Storvik et al

Project Tugas Besar pertama Pengenalan Komputasi Institut Teknologi Bandung