SAINT PyTorch implementation

Last update: Dec 25, 2022

Overview

SAINT-pytorch

A Simple pyTorch implementation of "Towards an Appropriate Query, Key, and Value Computation for Knowledge Tracing" based on https://arxiv.org/abs/2002.07033.

SAINT: Separated Self-AttentIve Neural Knowledge Tracing. SAINT has an encoder-decoder structure where exercise and response embedding sequence separately enter the encoder and the decoder respectively, which allows to stack attention layers multiple times.

SAINT model architecture

Usage

import torch
import torch.nn as nn
import torch.nn.functional as F
import numpy as np
import copy

from saint import saint, random_data

seq_len = 100
total_ex = 1200
total_cat = 234
total_in = 2

in_ex, in_cat, in_de = random_data(64, 
                                seq_len , 
                                total_ex, 
                                total_cat, 
                                total_in)


model = saint(dim_model=128,
            num_en=6,
            num_de=6,
            heads_en=8,
            heads_de=8,
            total_ex=total_ex,
            total_cat=total_cat,
            total_in=total_in )

outs = model(in_ex, in_cat, in_de)

print(outs.shape)
# torch.Size([64, 100, 1])

Parameters

dim_model: int.
Dimension of model ( embeddings, attention, linear layers).
num_en: int.
Number of encoder layers.
num_de: int.
Number of decoder layers.
heads_en: int.
Number of heads in multi-head attention block in each layer of encoder.
heads_de: int.
Number of heads in multi-head attention block in each layer of decoder.
total_ex: int.
Total number of unique excercise.
total_cat: int.
Total number of unique concept categories.
total_in: int.
Total number of unique interactions.

todo

change positional embedding to sine.

Citations

@article{choi2020towards,
  title={Towards an Appropriate Query, Key, and Value Computation for Knowledge Tracing},
  author={Choi, Youngduck and Lee, Youngnam and Cho, Junghyun and Baek, Jineon and Kim, Byungsoo and Cha, Yeongmin and Shin, Dongmin and Bae, Chan and Heo, Jaewe},
  journal={arXiv preprint arXiv:2002.07033},
  year={2020}
}

@misc{vaswani2017attention,
    title   = {Attention Is All You Need},
    author  = {Ashish Vaswani and Noam Shazeer and Niki Parmar and Jakob Uszkoreit and Llion Jones and Aidan N. Gomez and Lukasz Kaiser and Illia Polosukhin},
    year    = {2017},
    eprint  = {1706.03762},
    archivePrefix = {arXiv},
    primaryClass = {cs.CL}
}

SAINT PyTorch implementation

Related tags

Overview

SAINT-pytorch

SAINT model architecture

Usage

Parameters

todo

Citations

Owner

Arshad Shaikh

Large-scale pretraining for dialogue

Pretrained Japanese BERT models

Awesome Treasure of Transformers Models Collection

Code voor mijn Master project omtrent VideoBERT

Code-autocomplete, a code completion plugin for Python

Google's Meena transformer chatbot implementation

Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al

SimpleChinese2 集成了许多基本的中文NLP功能，使基于 Python 的中文文字处理和信息提取变得简单方便。

Japanese NLP Library

Train GPT-3 model on V100(16GB Mem) Using improved Transformer.

Utility for Google Text-To-Speech batch audio files generator. Ideal for prompt files creation with Google voices for application in offline IVRs

🌐 Translation microservice powered by AI

Official code of our work, Unified Pre-training for Program Understanding and Generation [NAACL 2021].

Sequence-to-Sequence learning using PyTorch

KR-FinBert And KR-FinBert-SC

jiant is an NLP toolkit

Simple GUI where you can enter an article and get a crisp summarized version.

KoBERT - Korean BERT pre-trained cased (KoBERT)

This is the 25 + 1 year anniversary version of the 1995 Rachford-Rice contest

Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.