CTAB-GAN

This is the official git paper CTAB-GAN: Effective Table Data Synthesizing. The paper is published on Asian Conference on Machine Learning (ACML 2021), please check our pdf on PMLR website for our newest version of paper, it adds more content on time consumption analysis of training CTAB-GAN. If you have any question, please contact [email protected] for more information.

Example

Experiment_Script_Adult.ipynb is an example notebook for training CTAB-GAN with Adult dataset. The dataset is alread under Real_Datasets folder. The evaluation code is also provided.

For large dataset

If your dataset has large number of column, you may encounter the problem that our currnet code cannot encode all of your data since CTAB-GAN will wrap the encoded data into an image-like format. What you can do is changing the line 341 and 348 in model/synthesizer/ctabgan_synthesizer.py. The number in the slide list

sides = [4, 8, 16, 24, 32]

is the side size of image. You can enlarge the list to [4, 8, 16, 24, 32, 64] or [4, 8, 16, 24, 32, 64, 128] for accepting larger dataset.

Bibtex

To cite this paper, you could use this bibtex

@InProceedings{zhao21,
  title = 	 {CTAB-GAN: Effective Table Data Synthesizing},
  author =       {Zhao, Zilong and Kunar, Aditya and Birke, Robert and Chen, Lydia Y.},
  booktitle = 	 {Proceedings of The 13th Asian Conference on Machine Learning},
  pages = 	 {97--112},
  year = 	 {2021},
  editor = 	 {Balasubramanian, Vineeth N. and Tsang, Ivor},
  volume = 	 {157},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {17--19 Nov},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v157/zhao21a/zhao21a.pdf},
  url = 	 {https://proceedings.mlr.press/v157/zhao21a.html}
}

Official git for "CTAB-GAN: Effective Table Data Synthesizing"

Related tags

Overview

CTAB-GAN

Example

For large dataset

Bibtex

Owner

An implementation demo of the ICLR 2021 paper Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks in PyTorch.

PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models

Minimal PyTorch implementation of YOLOv3

LogDeep is an open source deeplearning-based log analysis toolkit for automated anomaly detection.

A python module for configuration of block devices

Automatic Idiomatic Expression Detection

Improving Generalization Bounds for VC Classes Using the Hypergeometric Tail Inversion

:fire: 2D and 3D Face alignment library build using pytorch

Network Pruning That Matters: A Case Study on Retraining Variants (ICLR 2021)

3rd Place Solution of the Traffic4Cast Core Challenge @ NeurIPS 2021

[NeurIPS'21] "AugMax: Adversarial Composition of Random Augmentations for Robust Training" by Haotao Wang, Chaowei Xiao, Jean Kossaifi, Zhiding Yu, Animashree Anandkumar, and Zhangyang Wang.

House_prices_kaggle - Predict sales prices and practice feature engineering, RFs, and gradient boosting

[ICCV 2021] Our work presents a novel neural rendering approach that can efficiently reconstruct geometric and neural radiance fields for view synthesis.

StarGAN - Official PyTorch Implementation (CVPR 2018)

Pytorch Performace Tuning, WandB, AMP, Multi-GPU, TensorRT, Triton

Train neural network for semantic segmentation (deep lab V3) with pytorch in less then 50 lines of code

Old Photo Restoration (Official PyTorch Implementation)

HHP-Net: A light Heteroscedastic neural network for Head Pose estimation with uncertainty

Retrieve and analysis data from SDSS (Sloan Digital Sky Survey)

Prometheus Exporter for data scraped from datenplattform.darmstadt.de