Codes for Causal Semantic Generative model (CSG), the model proposed in "Learning Causal Semantic Representation for Out-of-Distribution Prediction" (NeurIPS-21)

Last update: Dec 01, 2022

Overview

Learning Causal Semantic Representation for Out-of-Distribution Prediction

This repository is the official implementation of "Learning Causal Semantic Representation for Out-of-Distribution Prediction" (NeurIPS 2021).

Chang Liu <[email protected]>, Xinwei Sun, Jindong Wang, Haoyue Tang, Tao Li, Tao Qin, Wei Chen, Tie-Yan Liu.
[Paper & Appendix] [Slides] [Video] [Poster]

Introduction

The work proposes a Causal Semantic Generative model (CSG) for OOD generalization (single-source domain generalization) and domain adaptation. The model is developed following a causal reasoning process, and prediction is made by leveraging the causal invariance principle. Training and prediction algorithms are developed based on variational Bayes with a novel design. Theoretical guarantees on the identifiability of the causal factor and the benefits for OOD prediction are presented.

This codebase implements the CSG methods, and implements or integrates various baselines. Most domain adaptation baselines (except BNM) use the dalib package. The experiment setups on the PACS and VLCS datasets are adopted from the domainbed repository. Authorships are clarified in each file or module.

Requirements

The code requires python version >= 3.6, and is based on PyTorch. To install requirements:

pip install -r requirements.txt

Usage

Folder a-mnist contains scripts to run the experiments on the Shifted-MNIST dataset, and a-imageclef on the ImageCLEF-DA dataset, and a-domainbed on the PACS and VLCS datasets (the prefix a- represents "application").

Go to the respective folder and run the prepare_data.sh or makedata.sh script there to prepare the datasets. Run the run_ood.sh (for OOD generalization methods) and run_da.sh (for domain adaptation methods) scripts to train the models. Evaluation result (accuracy on the test domain) is printed and written to disk with the model and configurations. See the commands in the script files or python3 main.py --help for customized usage or hyperparameter tuning.

Codes for Causal Semantic Generative model (CSG), the model proposed in "Learning Causal Semantic Representation for Out-of-Distribution Prediction" (NeurIPS-21)

Related tags

Overview

Learning Causal Semantic Representation for Out-of-Distribution Prediction

Introduction

Requirements

Usage

Owner

Chang Liu

Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

🛠 All-in-one web-based IDE specialized for machine learning and data science.

PFLD pytorch Implementation

A pytorch &keras implementation and demo of Fastformer.

💡 Learnergy is a Python library for energy-based machine learning models.

VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition

Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research

DAT4 - General Assembly's Data Science course in Washington, DC

Code for sound field predictions in domains with impedance boundaries. Used for generating results from the paper

The implementation of the lifelong infinite mixture model

This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

RepVGG: Making VGG-style ConvNets Great Again

Remote sensing change detection tool based on PaddlePaddle

Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020

E2C implementation in PyTorch

EdiBERT, a generative model for image editing

Deep Q-Learning Network in pytorch (not actively maintained)

An implementation of EWC with PyTorch

Wav2Vec for speech recognition, classification, and audio classification

Quick program made to generate alpha and delta tables for Hidden Markov Models