Behavioral Testing of Clinical NLP Models

This repository contains code for testing the behavior of clinical prediction models based on patient letters. For a detailed description of the testing framework see our paper What Do You See in this Patient? Behavioral Testing of Clinical NLP Models.

Usage

Install requirements: pip install -r requirements.txt

Run main.py, e.g. for diagnosis prediction test on gender, age and ethnicity:

python main.py 
    --test_set_path ./path_to_test_set
    --model_path bvanaken/CORe-clinical-diagnosis-prediction
    --task diagnosis
    --shift_keys gender,age,ethnicity
    --save_dir ./results
    --gpu False

Parameter	Description
test_set_path	Path to original test set file
model_path	Path to model or Huggingface model hub checkpoint
task	Current options: diagnosis, mortality
shift_keys	Which patient characteristics to test. Current options: age, gender, ethnicity, weight, intersectional (gender + ethnicity)
save_dir	Directory to save results, default: "./results"
gpu	Whether to use a gpu during inference or not, default: False

Using Non-Transformer models

The framework currently focuses on testing Transformer-based models. However, it is easy to extend it to any other prediction model. To do so, simply create a new class implementing the Predictor interface and add it to the TASK_MAP in main.py.

Cite

@inproceedings{vanAken2021,
  author    = {Betty van Aken and
               Sebastian Herrmann and
               Alexander Löser},
  title     = {What Do You See in this Patient? Behavioral Testing of Clinical NLP Models},
  booktitle = {Bridging the Gap: From Machine Learning Research to Clinical Practice, 
               Research2Clinics Workshop @ NeurIPS 2021},
  year      = {2021}
}

Behavioral Testing of Clinical NLP Models

Related tags

Overview

Behavioral Testing of Clinical NLP Models

Usage

Using Non-Transformer models

Cite

Owner

Betty van Aken

Spooky Skelly For Python

NLP Core Library and Model Zoo based on PaddlePaddle 2.0

Pytorch-Named-Entity-Recognition-with-BERT

Library for fast text representation and classification.

SentAugment is a data augmentation technique for semi-supervised learning in NLP.

Ask for weather information like a human

Malware-Related Sentence Classification

Basic yet complete Machine Learning pipeline for NLP tasks

Binaural Speech Synthesis

Dust model dichotomous performance analysis

Entity Disambiguation as text extraction (ACL 2022)

DiY Oxygen Concentrator based on the OxiKit

Yet Another Sequence Encoder - Encode sequences to vector of vector in python !

An implementation of the Pay Attention when Required transformer

Th2En & Th2Zh: The large-scale datasets for Thai text cross-lingual summarization

Text vectorization tool to outperform TFIDF for classification tasks

CCF BDCI 2020 房产行业聊天问答匹配赛道 A榜47/2985

LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)

DeepPavlov Tutorials

Python implementation of TextRank for phrase extraction and summarization of text documents