CausaLM: Causal Model Explanation Through Counterfactual Language Models

Last update: Jul 10, 2022

Overview

CausaLM: Causal Model Explanation Through Counterfactual Language Models

Authors:

Amir Feder, Nadav Oved, Uri Shalit, Roi Reichart

Abstract:

Understanding predictions made by deep neural networks is notoriously difficult, but also crucial to their dissemination. As all ML-based methods, they are as good as their training data, and can also capture unwanted biases. While there are tools that can help understand whether such biases exist, they do not distinguish between correlation and causation, and might be ill-suited for text-based models and for reasoning about high level language concepts. A key problem of estimating the causal effect of a concept of interest on a given model is that this estimation requires the generation of counterfactual examples, which is challenging with existing generation technology. To bridge that gap, we propose CausaLM, a framework for producing causal model explanations using counterfactual language representation models. Our approach is based on fine-tuning of deep contextualized embedding models with auxiliary adversarial tasks derived from the causal graph of the problem. Concretely, we show that by carefully choosing auxiliary adversarial pre-training tasks, language representation models such as BERT can effectively learn a counterfactual representation for a given concept of interest, and be used to estimate its true causal effect on model performance. A byproduct of our method is a representation that is unaffected by the tested concept, which can be useful in mitigating unwanted bias ingrained in the data.

CausaLM: Causal Model Explanation Through Counterfactual Language Models

Related tags

Overview

CausaLM: Causal Model Explanation Through Counterfactual Language Models

Authors:

Amir Feder, Nadav Oved, Uri Shalit, Roi Reichart

Abstract:

Links:

Paper

Code

Data

Owner

Amir Feder

1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection

Deep Learning Emotion decoding using EEG data from Autism individuals

This repository collects 100 papers related to negative sampling methods.

Forecasting directional movements of stock prices for intraday trading using LSTM and random forest

Modelisation on galaxy evolution using PEGASE-HR

PyTorch source code for Distilling Knowledge by Mimicking Features

PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT.

[ICCV '21] In this repository you find the code to our paper Keypoint Communities

The official code repository for examples in the O'Reilly book 'Generative Deep Learning'

Tools for computational pathology

In this work, we will implement some basic but important algorithm of machine learning step by step.

Nicely is a real-time Feedback and Intervention Program Depression is a prevalent issue across all age groups, socioeconomic classes, and cultural identities.

Machine Learning Framework for Operating Systems - Brings ML to Linux kernel

A module for solving and visualizing Schrödinger equation.

Repo for EchoVPR: Echo State Networks for Visual Place Recognition

face_recognization (FaceNet) + TFHE (HNP) + hand_face_detection (Mediapipe)

JudeasRx - graphical app for doing personalized causal medicine using the methods invented by Judea Pearl et al.

Official Matlab Implementation for "Tiny Obstacle Discovery by Occlusion-aware Multilayer Regression", TIP 2020

基于AlphaPose的TensorRT加速

Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing (CVPR 2018).