A deep learning tabular classification architecture inspired by TabTransformer with integrated gated multilayer perceptron.

Last update: Dec 15, 2022

Overview

The GatedTabTransformer.

A deep learning tabular classification architecture inspired by TabTransformer with integrated gated multilayer perceptron. Check out our paper on arXiv.

Usage

import torch
import torch.nn as nn
from gated_tab_transformer import GatedTabTransformer

model = GatedTabTransformer(
    categories = (10, 5, 6, 5, 8),      # tuple containing the number of unique values within each category
    num_continuous = 10,                # number of continuous values
    transformer_dim = 32,               # dimension, paper set at 32
    dim_out = 1,                        # binary prediction, but could be anything
    transformer_depth = 6,              # depth, paper recommended 6
    transformer_heads = 8,              # heads, paper recommends 8
    attn_dropout = 0.1,                 # post-attention dropout
    ff_dropout = 0.1,                   # feed forward dropout
    mlp_act = nn.LeakyReLU(0),          # activation for final mlp, defaults to relu, but could be anything else (selu, etc.)
    mlp_depth=4,                        # mlp hidden layers depth
    mlp_dimension=32,                   # dimension of mlp layers
    gmlp_enabled=True                   # gmlp or standard mlp
)

x_categ = torch.randint(0, 5, (1, 5))   # category values, from 0 - max number of categories, in the order as passed into the constructor above
x_cont = torch.randn(1, 10)             # assume continuous values are already normalized individually

pred = model(x_categ, x_cont)
print(pred)

Citation

@misc{cholakov2022gatedtabtransformer,
      title={The GatedTabTransformer. An enhanced deep learning architecture for tabular modeling}, 
      author={Radostin Cholakov and Todor Kolev},
      year={2022},
      eprint={2201.00199},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

@software{GatedTabTransformer,
  author = {{Radostin Cholakov, Todor Kolev}},
  title = {The GatedTabTransformer.},
  url = {https://github.com/radi-cho/GatedTabTransformer},
  version = {0.0.1},
  date = {2021-12-15},
}

A deep learning tabular classification architecture inspired by TabTransformer with integrated gated multilayer perceptron.

Related tags

Overview

The GatedTabTransformer.

Usage

Citation

Owner

Radi Cho

Marine debris detection with commercial satellite imagery and deep learning.

DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering".

A Transformer-Based Feature Segmentation and Region Alignment Method For UAV-View Geo-Localization

Character-Input - Create a program that asks the user to enter their name and their age

A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision for Visual Scene Graph Generation''

Tensorflow 2.x implementation of Panoramic BlitzNet for object detection and semantic segmentation on indoor panoramic images.

PyTorch code for DriveGAN: Towards a Controllable High-Quality Neural Simulation

Subnet Replacement Attack: Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks

Datasets, tools, and benchmarks for representation learning of code.

Implementation of "StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis"

GitHub repository for the ICLR Computational Geometry & Topology Challenge 2021

The code for "Deep Level Set for Box-supervised Instance Segmentation in Aerial Images".

PyTorch implementation of the NIPS-17 paper "Poincaré Embeddings for Learning Hierarchical Representations"

OpenCV, MediaPipe Pose Estimation, Affine Transform for Icon Overlay

GAN example for Keras. Cuz MNIST is too small and there should be something more realistic.

Keras like implementation of Deep Learning architectures from scratch using numpy.

Python Tensorflow 2 scripts for detecting objects of any class in an image without knowing their label.

FcaNet: Frequency Channel Attention Networks