DANet for Tabular data classification/ regression.

Last update: Sep 14, 2022

Related tags

Overview

Deep Abstract Networks

A pyTorch implementation for AAAI-2022 paper DANets: Deep Abstract Networks for Tabular Data Classification and Regression.

Brief Introduction

Tabular data are ubiquitous in real world applications. Although many commonly-used neural components (e.g., convolution) and extensible neural networks (e.g., ResNet) have been developed by the machine learning community, few of them were effective for tabular data and few designs were adequately tailored for tabular data structures. In this paper, we propose a novel and flexible neural component for tabular data, called Abstract Layer (AbstLay), which learns to explicitly group correlative input features and generate higher-level features for semantics abstraction. Also, we design a structure re-parameterization method to compress AbstLay, thus reducing the computational complexity by a clear margin in the reference phase. A special basic block is built using AbstLays, and we construct a family of Deep Abstract Networks (DANets) for tabular data classification and regression by stacking such blocks. In DANets, a special shortcut path is introduced to fetch information from raw tabular features, assisting feature interactions across different levels. Comprehensive experiments on real-world tabular datasets show that our AbstLay and DANets are effective for tabular data classification and regression, and the computational complexity is superior to competitive methods.

DANets illustration

Downloads

Dataset

Download the datasets from the following links:

(Optional) Before starting the program, you may change the file format to .pkl by using svm2pkl() or csv2pkl() functions in ./data/data_util.py.

Weights for inference models

The demo weights for Forest Cover Type dataset is available in the folder "./Weights/".

How to use

Setting

Clone or download this repository, and cd the path.
Build a working python environment. Python 3.7 is fine for this repository.
Install packages following the requirements.txt, e.g., by using pip install -r requirements.txt.

Training

Set the hyperparameters in config files (./config/default.py or ./config/*.yaml).
Notably, the hyperparameters in .yaml file will cover those in default.py.
Run by python main.py --c [config_path] --g [gpu_id].
- -c: The config file path
- -g: GPU device ID
The checkpoint models and best models will be saved at the ./logs file.

Inference

Replace the resume_dir path with the file path containing your trained model/weight.
Run codes by using python predict.py -d [dataset_name] -m [model_file_path] -g [gpu_id].
- -d: Dataset name
- -m: Model path for loading
- -g: GPU device ID

Config Hyperparameters

Normal parameters

dataset: str
The dataset name given must match those in ./data/dataset.py.
task: str
Choose one of the pre-given tasks 'classification' and 'regression'.
resume_dir: str
The log path containing the checkpoint models.
logname: str
The directory names of the models save at ./logs.
seed: int
The random seed.

Model parameters

layer: int (default=20)
Number of abstract layers to stack
k: int (default=5)
Number of masks
base_outdim: int (default=64)
The output feature dimension in abstract layer.
drop_rate: float (default=0.1)
Dropout rate in shortcut module

Fit parameters

lr: float (default=0.008)
Learning rate
max_epochs: int (default=5000)
Maximum number of epochs in training.
patience: int (default=1500)
Number of consecutive epochs without improvement before performing early stopping. If patience is set to 0, then no early stopping will be performed.
batch_size: int (default=8192)
Number of examples per batch.
virtual_batch_size: int (default=256)
Size of the mini batches used for "Ghost Batch Normalization". virtual_batch_size must divide batch_size.

Citations

@inproceedings{danets, 
   title={DANets: Deep Abstract Networks for Tabular Data Classification and Regression}, 
   author={Chen, Jintai and Liao, Kuanlun and Wan, Yao and Chen, Danny Z and Wu, Jian}, 
   booktitle={AAAI}, 
   year={2022}
 }

DANet for Tabular data classification/ regression.

Related tags

Overview

Deep Abstract Networks

Brief Introduction

DANets illustration

Downloads

Dataset

Weights for inference models

How to use

Setting

Training

Inference

Config Hyperparameters

Normal parameters

Model parameters

Fit parameters

Citations

Owner

Ronnie Rocket

Event queue (Equeue) dialect is an MLIR Dialect that models concurrent devices in terms of control and structure.

CryptoFrog - My First Strategy for freqtrade

Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

SketchEdit: Mask-Free Local Image Manipulation with Partial Sketches

Code for CVPR 2021 paper: Anchor-Free Person Search

NaijaSenti is an open-source sentiment and emotion corpora for four major Nigerian languages

Federated Learning Based on Dynamic Regularization

[ACM MM 2019 Oral] Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation

This is a Python Module For Encryption, Hashing And Other stuff

Generative Models for Graph-Based Protein Design

Super-BPD: Super Boundary-to-Pixel Direction for Fast Image Segmentation (CVPR 2020)

Fast (simple) spectral synthesis and emission-line fitting of DESI spectra.

ComputerVision - This repository aims at realized easy network architecture

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

This repository contains the source code for the paper "DONeRF: Towards Real-Time Rendering of Compact Neural Radiance Fields using Depth Oracle Networks",

python 93% acc. CNN Dogs Vs Cats ( Pytorch )

Self-Supervised Learning

The PyTorch re-implement of a 3D CNN Tracker to extract coronary artery centerlines with state-of-the-art (SOTA) performance. (paper: 'Coronary artery centerline extraction in cardiac CT angiography using a CNN-based orientation classiﬁer')

Pansharpening by convolutional neural networks in the full resolution framework

generate-2D-quadrilateral-mesh-with-neural-networks-and-tree-search