Instance-wise Feature Importance in Time (FIT)

FIT is a framework for explaining time series perdiction models, by assigning feature importance to every observation over time. paper

To run the experiments, you need a trained prediction model that takes in time series data as input, and generates a prediction over time. You also need the training data to train the FIT generator. Below are the instruction for replicating experiments in the paper.

Data preparation

Two different simulated datasets are used in the experiments. The process of creating the data is explained below.

Simulated dataset (State data):

Run the following script to create the data and the ground thruth explanations for the state experiment. You can choose the total number of samples in the dataset as well as the lenght of each recording. The defaults are set to 1000 samples of length 100.

python3 data_generator/state_data.py --signal_len LENGTH_OF_SIGNALS --signal_num TOTAL_NUMBER_OF_SAMPLES

Simulated dataset (Spike data):

python3 data_generator/simulations_threshold_spikes.py

MIMIC ICU dataset:

You need to have the MIMICIII database running on a server. Run the following scripts to query and preprocess the ICU mortality data (This step might take a few hours)

python3 data_generator/icu_mortality.py --sqluser YOUR_USER --sqlpass YOUR_PASSWORD

Run the following scripts to query and preprocess the ICU mortality data (This step might take a few hours)

python3 data_generator/icu_mortality.py ---sqluser YOUR_USER --sqlpass YOUR_PASSWORD

Running the importance assignment baselines

For running the experiments, you need to train: 1) The black-box predictor model and 2) the conditional generator. You can do this by passing the --train argument. If a model and conditional generator is already trained, skip the '--train' argument. To generate explanations for test samples using any of the baselines and for your required dataset (simulation, simulation_spike, mimic), run the following module.

python3 -m evaluation.baselines --data DATASET_NAME --explainer EXPLAINER_MODEL --train

In addition to FIT, you can also run experiments on different baseline explainers such as retain, deep lift, feature occlusion, etc.

Instance-wise Feature Importance in Time (FIT)

Related tags

Overview

Instance-wise Feature Importance in Time (FIT)

Data preparation

Simulated dataset (State data):

Simulated dataset (Spike data):

MIMIC ICU dataset:

Running the importance assignment baselines

Owner

Sana

MOpt-AFL provided by the paper "MOPT: Optimized Mutation Scheduling for Fuzzers"

Lip Reading - Cross Audio-Visual Recognition using 3D Convolutional Neural Networks

Implementation of Analyzing and Improving the Image Quality of StyleGAN (StyleGAN 2) in PyTorch

PiRank: Learning to Rank via Differentiable Sorting

Autotype on websites that have copy-paste disabled like Moodle, HackerEarth contest etc.

Code for Multinomial Diffusion

Learning cell communication from spatial graphs of cells

TagLab: an image segmentation tool oriented to marine data analysis

Chainer Implementation of Semantic Segmentation using Adversarial Networks

MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.

LoL Runes Recommender With Python

PyTorch implementation of DirectCLR from paper Understanding Dimensional Collapse in Contrastive Self-supervised Learning

PyTorch implementation of PP-LCNet

Simple image captioning model - CLIP prefix captioning.

Python-based Informatics Kit for Analysing Chemical Units

一些经典的CTR算法的复现; LR, FM, FFM, AFM, DeepFM，xDeepFM, PNN, DCN, DCNv2, DIFM, AutoInt, FiBiNet,AFN,ONN,DIN, DIEN ... （pytorch, tf2.0）

Clairvoyance: a Unified, End-to-End AutoML Pipeline for Medical Time Series

An Unsupervised Detection Framework for Chinese Jargons in the Darknet

Train Dense Passage Retriever (DPR) with a single GPU

On the Analysis of French Phonetic Idiosyncrasies for Accent Recognition