Loading…

A Graph-Transformer for Whole Slide Image Classification

Deep learning is a powerful tool for whole slide image (WSI) analysis. Typically, when performing supervised deep learning, a WSI is divided into small patches, trained and the outcomes are aggregated to estimate disease grade. However, patch-based methods introduce label noise during training by as...

Full description

Saved in:

Bibliographic Details
Published in:	IEEE transactions on medical imaging 2022-11, Vol.41 (11), p.3003-3015
Main Authors:	Zheng, Yi, Gindra, Rushin H., Green, Emily J., Burks, Eric J., Betke, Margrit, Beane, Jennifer E., Kolachalama, Vijaya B.
Format:	Article
Language:	English
Subjects:	Adenocarcinoma Cancer Classification Deep learning Digital pathology Feature extraction Gene mapping Genomes graph convolutional network Graphical representations Guanosine Triphosphate Image classification Image processing Image Processing, Computer-Assisted - methods Lung lung cancer Machine learning Medical imaging Pathology Proteomics Squamous cell carcinoma Training Transformers Tumors vision transformer
Citations:	Items that this one cites Items that cite this one
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Deep learning is a powerful tool for whole slide image (WSI) analysis. Typically, when performing supervised deep learning, a WSI is divided into small patches, trained and the outcomes are aggregated to estimate disease grade. However, patch-based methods introduce label noise during training by assuming that each patch is independent with the same label as the WSI and neglect overall WSI-level information that is significant in disease grading. Here we present a Graph-Transformer (GT) that fuses a graph-based representation of an WSI and a vision transformer for processing pathology images, called GTP, to predict disease grade. We selected 4,818 WSIs from the Clinical Proteomic Tumor Analysis Consortium (CPTAC), the National Lung Screening Trial (NLST), and The Cancer Genome Atlas (TCGA), and used GTP to distinguish adenocarcinoma (LUAD) and squamous cell carcinoma (LSCC) from adjacent non-cancerous tissue (normal). First, using NLST data, we developed a contrastive learning framework to generate a feature extractor. This allowed us to compute feature vectors of individual WSI patches, which were used to represent the nodes of the graph followed by construction of the GTP framework. Our model trained on the CPTAC data achieved consistently high performance on three-label classification (normal versus LUAD versus LSCC: mean accuracy = 91.2 ± 2.5%) based on five-fold cross-validation, and mean accuracy = 82.3 ± 1.0% on external test data (TCGA). We also introduced a graph-based saliency mapping technique, called GraphCAM, that can identify regions that are highly associated with the class label. Our findings demonstrate GTP as an interpretable and effective deep learning framework for WSI-level classification.
ISSN:	0278-0062 1558-254X
DOI:	10.1109/TMI.2022.3176598