CausaLM: Causal Model Explanation Through Counterfactual Language Models

Last update: Jul 10, 2022

Overview

CausaLM: Causal Model Explanation Through Counterfactual Language Models

Authors:

Amir Feder, Nadav Oved, Uri Shalit, Roi Reichart

Abstract:

Understanding predictions made by deep neural networks is notoriously difficult, but also crucial to their dissemination. As all ML-based methods, they are as good as their training data, and can also capture unwanted biases. While there are tools that can help understand whether such biases exist, they do not distinguish between correlation and causation, and might be ill-suited for text-based models and for reasoning about high level language concepts. A key problem of estimating the causal effect of a concept of interest on a given model is that this estimation requires the generation of counterfactual examples, which is challenging with existing generation technology. To bridge that gap, we propose CausaLM, a framework for producing causal model explanations using counterfactual language representation models. Our approach is based on fine-tuning of deep contextualized embedding models with auxiliary adversarial tasks derived from the causal graph of the problem. Concretely, we show that by carefully choosing auxiliary adversarial pre-training tasks, language representation models such as BERT can effectively learn a counterfactual representation for a given concept of interest, and be used to estimate its true causal effect on model performance. A byproduct of our method is a representation that is unaffected by the tested concept, which can be useful in mitigating unwanted bias ingrained in the data.

CausaLM: Causal Model Explanation Through Counterfactual Language Models

Related tags

Overview

CausaLM: Causal Model Explanation Through Counterfactual Language Models

Authors:

Amir Feder, Nadav Oved, Uri Shalit, Roi Reichart

Abstract:

Links:

Paper

Code

Data

Owner

Amir Feder

the code of the paper: Recurrent Multi-view Alignment Network for Unsupervised Surface Registration (CVPR 2021)

Implementation of Hire-MLP: Vision MLP via Hierarchical Rearrangement and An Image Patch is a Wave: Phase-Aware Vision MLP.

ScaleNet: A Shallow Architecture for Scale Estimation

FEDn is an open-source, modular and ML-framework agnostic framework for Federated Machine Learning

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [2021]

Social Network Ads Prediction

Poplar implementation of "Bundle Adjustment on a Graph Processor" (CVPR 2020)

The code from the paper Character Transformations for Non-Autoregressive GEC Tagging

PyTorch implementation of ENet

PRIN/SPRIN: On Extracting Point-wise Rotation Invariant Features

UnsupervisedR&R: Unsupervised Pointcloud Registration via Differentiable Rendering

Code for the AI lab course 2021/2022 of the University of Verona

PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"

Implementation of "Meta-rPPG: Remote Heart Rate Estimation Using a Transductive Meta-Learner"

NLG evaluation via Statistical Measures of Similarity: BaryScore, DepthScore, InfoLM

Official PyTorch implementation of UACANet: Uncertainty Aware Context Attention for Polyp Segmentation

BADet: Boundary-Aware 3D Object Detection from Point Clouds (Pattern Recognition 2022)

PyTorch Implementation for AAAI'21 "Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection"

object detection; robust detection; ACM MM21 grand challenge; Security AI Challenger Phase VII

Official Repository for our ICCV2021 paper: Continual Learning on Noisy Data Streams via Self-Purified Replay