Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides

[email protected]">

Last update: Dec 28, 2022

Overview

Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides

This repo is the official implementation of our paper "Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides".

Our paper is accepted by Frontiers in Oncology, and you can also get access our paper from MedRxiv.

Abstract

Objectives: To develop and validate a deep learning (DL)-based primary tumor biopsy signature for predicting axillary lymph node (ALN) metastasis preoperatively in early breast cancer (EBC) patients with clinically negative ALN.
Methods: A total of 1,058 EBC patients with pathologically confirmed ALN status were enrolled from May 2010 to August 2020. A DL core-needle biopsy (DL-CNB) model was built on the attention-based multiple instance-learning (AMIL) framework to predict ALN status utilizing the DL features, which were extracted from the cancer areas of digitized whole-slide images (WSIs) of breast CNB specimens annotated by two pathologists. Accuracy, sensitivity, specificity, receiver operating characteristic (ROC) curves, and areas under the ROC curve (AUCs) were analyzed to evaluate our model.
Results: The best-performing DL-CNB model with VGG16_BN as the feature extractor achieved an AUC of 0.816 (95% confidence interval (CI): 0.758, 0.865) in predicting positive ALN metastasis in the independent test cohort. Furthermore, our model incorporating the clinical data, which was called DL-CNB+C, yielded the best accuracy of 0.831 (95% CI: 0.775, 0.878), especially for patients younger than 50 years (AUC: 0.918, 95% CI: 0.825, 0.971). The interpretation of DL-CNB model showed that the top signatures most predictive of ALN metastasis were characterized by the nucleus features including density (p = 0.015), circumference (p = 0.009), circularity (p = 0.010), and orientation (p = 0.012).
Conclusion: Our study provides a novel DL-based biomarker on primary tumor CNB slides to predict the metastatic status of ALN preoperatively for patients with EBC.

Data

Our data includes whole slide images (WSIs) of breast cancer patients and the corresponding clinical data. According to the axillary lymph node (ALN) metastasis, 1058 patients are divided into the following 3 categories:

N0: having no positive lymph nodes (655 patients, 61.9%).
N+(1~2): having one or two positive lymph nodes (210 patients, 19.8%).
N+(>2): having three or more positive lymph nodes (193 patients, 18.3%).

Here we have provided some WSI samples and clinical data samples, you can review our paper for more details.

For full access to the BALNMP Dataset, please contact us and the usage of BALNMP Dataset must follow the license.

WSI samples

N0

N+(1~2)

N+(>2)

Clinical Data Samples

Pre-Trained Models

Please download pre-trained models from here.

Demo Software

We have also provided software for easily checking the performance of our model to predict ALN metastasis.

Please download the software from here, and check the README.txt for usage. Please note that this software is only used for demo, and it cannot be used for other purposes.

Citation

Please cite our paper in your publications if it helps your research.

@article{xu2021predicting,
  title={Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides},
  author={Xu, Feng and Zhu, Chuang and Tang, Wenqi and Wang, Ying and Zhang, Yu and Li, Jie and Jiang, Hongchuan and Shi, Zhongyue and Liu, Jun and Jin, Mulan},
  journal={Frontiers in Oncology},
  pages={4133},
  year={2021},
  publisher={Frontiers}
}

License

This BALNMP Dataset is made freely available to academic and non-academic entities for non-commercial purposes such as academic research, teaching, scientific publications, or personal experimentation. Permission is granted to use the data given that you agree to our license terms bellow:

That you include a reference to the BALNMP Dataset in any work that makes use of the dataset. For research papers, cite our preferred publication as listed on our website; for other media cite our preferred publication as listed on our website or link to the BALNMP website.
That you do not distribute this dataset or modified versions. It is permissible to distribute derivative works in as far as they are abstract representations of this dataset (such as models trained on it or additional annotations that do not directly include any of our data).
That you may not use the dataset or any derivative work for commercial purposes as, for example, licensing or selling the data, or using the data with a purpose to procure a commercial gain.
That all rights not expressly granted to you are reserved by us.

Contact

email: [email protected], [email protected], [email protected]

Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides

Related tags

Overview

Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using Deep Learning on Primary Tumor Biopsy Slides

Abstract

Data

WSI samples

N0

N+(1~2)

N+(>2)

Clinical Data Samples

Pre-Trained Models

Demo Software

Citation

License

Contact

Owner

CVSM Group - email: [email protected]

Implementation of UNet on the Joey ML framework

StyleMapGAN - Official PyTorch Implementation

DeceFL: A Principled Decentralized Federated Learning Framework

Code base for the paper "Scalable One-Pass Optimisation of High-Dimensional Weight-Update Hyperparameters by Implicit Differentiation"

Real time sign language recognition

CarND-LaneLines-P1 - Lane Finding Project for Self-Driving Car ND

GradAttack is a Python library for easy evaluation of privacy risks in public gradients in Federated Learning

A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision for Visual Scene Graph Generation''

Intro-to-dl - Resources for "Introduction to Deep Learning" course.

A benchmark framework for Tensorflow

These are the materials for the paper "Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations"

How Do Adam and Training Strategies Help BNNs Optimization? In ICML 2021.

Goal of the project : Detecting Temporal Boundaries in Sign Language videos

CSKG is a commonsense knowledge graph that combines seven popular sources into a consolidated representation

Learning Energy-Based Models by Diffusion Recovery Likelihood

Large scale embeddings on a single machine.

A series of Python scripts to access measurements from Fluke 28X meters. Fluke IR Remote Interface required.

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator

Implementing DeepMind's Fast Reinforcement Learning paper

A PyTorch implementation of the paper Mixup: Beyond Empirical Risk Minimization in PyTorch