Dataset for the Research2Clinics @ NeurIPS 2021 Paper: What Do You See in this Patient? Behavioral Testing of Clinical NLP Models

Last update: Sep 20, 2022

Overview

Behavioral Testing of Clinical NLP Models

This repository contains code for testing the behavior of clinical prediction models based on patient letters. For a detailed description of the testing framework see our paper What Do You See in this Patient? Behavioral Testing of Clinical NLP Models.

Usage

Install requirements: pip install -r requirements.txt

Run main.py, e.g. for diagnosis prediction test on gender, age and ethnicity:

python main.py 
    --test_set_path ./path_to_test_set
    --model_path bvanaken/CORe-clinical-diagnosis-prediction
    --task diagnosis
    --shift_keys gender,age,ethnicity
    --save_dir ./results
    --gpu False

Parameter	Description
test_set_path	Path to original test set file
model_path	Path to model or Huggingface model hub checkpoint
task	Current options: diagnosis, mortality
shift_keys	Which patient characteristics to test. Current options: age, gender, ethnicity, weight, intersectional (gender + ethnicity)
save_dir	Directory to save results, default: "./results"
gpu	Whether to use a gpu during inference or not, default: False

Using Non-Transformer models

The framework currently focuses on testing Transformer-based models. However, it is easy to extend it to any other prediction model. To do so, simply create a new class implementing the Predictor interface and add it to the TASK_MAP in main.py.

Cite

@inproceedings{vanAken2021,
  author    = {Betty van Aken and
               Sebastian Herrmann and
               Alexander Löser},
  title     = {What Do You See in this Patient? Behavioral Testing of Clinical NLP Models},
  booktitle = {Bridging the Gap: From Machine Learning Research to Clinical Practice, 
               Research2Clinics Workshop @ NeurIPS 2021},
  year      = {2021}
}

Dataset for the Research2Clinics @ NeurIPS 2021 Paper: What Do You See in this Patient? Behavioral Testing of Clinical NLP Models

Related tags

Overview

Behavioral Testing of Clinical NLP Models

Usage

Using Non-Transformer models

Cite

Owner

Betty van Aken

HyperLib: Deep learning in the Hyperbolic space

Train a state-of-the-art yolov3 object detector from scratch!

Team Enigma at ArgMining 2021 Shared Task: Leveraging Pretrained Language Models for Key Point Matching

A strongly-typed genetic programming framework for Python

Implementation of Google Brain's WaveGrad high-fidelity vocoder

Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"

Accuracy Aligned. Concise Implementation of Swin Transformer

PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]

PyTorch implementation of our ICCV2021 paper: StructDepth: Leveraging the structural regularities for self-supervised indoor depth estimation

3D2Unet: 3D Deformable Unet for Low-Light Video Enhancement (PRCV2021)

Unofficial Implementation of MLP-Mixer, gMLP, resMLP, Vision Permutator, S2MLPv2, RaftMLP, ConvMLP, ConvMixer in Jittor and PyTorch.

Multi-modal Vision Transformers Excel at Class-agnostic Object Detection

FocusFace: Multi-task Contrastive Learning for Masked Face Recognition

A trusty face recognition research platform developed by Tencent Youtu Lab

Implementation of ECCV20 paper: the devil is in classification: a simple framework for long-tail object detection and instance segmentation

SpanNER: Named EntityRe-/Recognition as Span Prediction

Pytorch code for paper "Image Compressed Sensing Using Non-local Neural Network" TMM 2021.

Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth

Fast, accurate and reliable software for algebraic CT reconstruction