PyTorch reimplementation of REALM and ORQA

Last update: Aug 20, 2022

Related tags

Overview

PyTorch Reimplementation of REALM and ORQA

This is PyTorch reimplementation of REALM (paper, codebase) and ORQA (paper, codebase).

Some features have not been implemented yet, currently the predictor and finetuning script are available.

The term retriever and searcher in the code are basically interchangeable, their difference is that retriever is for REALM pretraining, and searcher is for ORQA finetuning.

Prerequisite

cd transformers && pip install -U -e ".[dev]"
pip install -U scann, apache_beam

Data

To download pretrained checkpoints and preprocessed data, please follow the instructions below:

cd data
pip install -U -r requirements.txt
sh download.sh

Finetune (Experimental)

The default finetuning dataset is Natural Question(NQ). To laod your custom dataset, please change the loading function in data.py.

Training:

python run_finetune.py --is_train \
    --model_dir "./" \
    --num_epochs 2 \
    --device cuda

Evaluation:

python run_finetune.py \
    --retriever_pretrained_name "retriever" \
    --checkpoint_pretrained_name "reader" \
    --model_dir "./" \
    --device cuda

Predict

The default checkpoints of retriever and reader are orqa_nq_model_from_realm. To change them, kindly specify --retriever_path and --checkpoint_path.

python predictor.py --question "Who is the pioneer in modern computer science?"

Output: alan mathison turing

License

Apache License 2.0

PyTorch reimplementation of REALM and ORQA

Related tags

Overview

PyTorch Reimplementation of REALM and ORQA

Prerequisite

Data

Finetune (Experimental)

Predict

License

Owner

Li-Huai (Allan) Lin

Annotate with anyone, anywhere.

This repo is a PyTorch implementation for Paper "Unsupervised Learning for Cuboid Shape Abstraction via Joint Segmentation from Point Clouds"

Awesome Remote Sensing Toolkit based on PaddlePaddle.

[ICCV2021] Learning to Track Objects from Unlabeled Videos

GalaXC: Graph Neural Networks with Labelwise Attention for Extreme Classification

A Python package for performing pore network modeling of porous media

TensorFlow (Python) implementation of DeepTCN model for multivariate time series forecasting.

T2F: text to face generation using Deep Learning

NAACL'2021: Factual Probing Is [MASK]: Learning vs. Learning to Recall

Facestar dataset. High quality audio-visual recordings of human conversational speech.

[ICCV'2021] "SSH: A Self-Supervised Framework for Image Harmonization", Yifan Jiang, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Kalyan Sunkavalli, Simon Chen, Sohrab Amirghodsi, Sarah Kong, Zhangyang Wang

Using LSTM write Tang poetry

REGTR: End-to-end Point Cloud Correspondences with Transformers

Source code and notebooks to reproduce experiments and benchmarks on Bias Faces in the Wild (BFW).

AISTATS 2019: Confidence-based Graph Convolutional Networks for Semi-Supervised Learning

A Pytorch implement of paper "Anomaly detection in dynamic graphs via transformer" (TADDY).

GrabGpu_py: a scripts for grab gpu when gpu is free

Official implementation of the paper "AAVAE: Augmentation-AugmentedVariational Autoencoders"

[NeurIPS 2020] Code for the paper "Balanced Meta-Softmax for Long-Tailed Visual Recognition"

A TensorFlow Implementation of "Deep Multi-Scale Video Prediction Beyond Mean Square Error" by Mathieu, Couprie & LeCun.