SurvTRACE: Transformers for Survival Analysis with Competing Events

Last update: Oct 06, 2022

Overview

⭐ SurvTRACE: Transformers for Survival Analysis with Competing Events

This repo provides the implementation of SurvTRACE for survival analysis. It is easy to use with only the following codes:

from survtrace.dataset import load_data
from survtrace.model import SurvTraceSingle
from survtrace import Evaluator
from survtrace import Trainer
from survtrace import STConfig

# use METABRIC dataset
STConfig['data'] = 'metabric'
df, df_train, df_y_train, df_test, df_y_test, df_val, df_y_val = load_data(STConfig)

# initialize model
model = SurvTraceSingle(STConfig)

# execute training
trainer = Trainer(model)
trainer.fit((df_train, df_y_train), (df_val, df_y_val))

# evaluating
evaluator = Evaluator(df, df_train.index)
evaluator.eval(model, (df_test, df_y_test))

print("done!")

🔥 See the demo

Please refer to experiment_metabric.ipynb and experiment_support.ipynb !

🔥 How to config the environment

Use our pre-saved conda environment!

conda env create --name survtrace --file=survtrace.yml
conda activate survtrace

or try to install from the requirement.txt

pip3 install -r requirements.txt

🔥 How to get SEER data

Go to https://seer.cancer.gov/data/ to ask for data request from SEER following the guide there.
After complete the step one, we should have the following seerstat software for data access. Open it and sign in with the username and password sent by seer.

Use seerstat to open the ./data/seer.sl file, we shall see the following.

Click on the 'excute' icon to request from the seer database. We will obtain a csv file.

move the csv file to ./data/seer_raw.csv, then run the python script process_seer.py, as
```
python process_seer.py
```
we will obtain the processed seer data named seer_processed.csv.

📝 Functions

single event survival analysis
competing events survival analysis
multi-task learning
automatic hyperparameter grid-search

😄 If you find this result interesting, please consider to cite this paper:

@article{wang2021survtrace,
      title={Surv{TRACE}: Transformers for Survival Analysis with Competing Events}, 
      author={Zifeng Wang and Jimeng Sun},
      year={2021},
      eprint={2110.00855},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

SurvTRACE: Transformers for Survival Analysis with Competing Events

Related tags

Overview

⭐ SurvTRACE: Transformers for Survival Analysis with Competing Events

🔥 See the demo

🔥 How to config the environment

🔥 How to get SEER data

📝 Functions

😄 If you find this result interesting, please consider to cite this paper:

Owner

Zifeng

GVT is a generic translation tool for parts of text on the PC screen with Text to Speak functionality.

Japanese NLP Library

Trex is a tool to match semantically similar functions based on transfer learning.

Unsupervised Document Expansion for Information Retrieval with Stochastic Text Generation

TFIDF-based QA system for AIO2 competition

A list of NLP(Natural Language Processing) tutorials built on Tensorflow 2.0.

Linking data between GBIF, Biodiverse, and Open Tree of Life

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

Treemap visualisation of Maya scene files

Predicting the usefulness of reviews given the review text and metadata surrounding the reviews.

Türkçe küfürlü içerikleri bulan bir yapay zeka kütüphanesi / An ML library for profanity detection in Turkish sentences

spaCy-wrap: For Wrapping fine-tuned transformers in spaCy pipelines

Turn clang-tidy warnings and fixes to comments in your pull request

Club chatbot

Higher quality textures for the Metal Gear Solid series.

Textpipe: clean and extract metadata from text

Tensorflow implementation of paper: Learning to Diagnose with LSTM Recurrent Neural Networks.

Rank-One Model Editing for Locating and Editing Factual Knowledge in GPT

PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset

An algorithm that can solve the word puzzle Wordle with an optimal number of guesses on HARD mode.