Loading…

Conv-Swinformer: Integration of CNN and shift window attention for Alzheimer’s disease classification

Deep learning (DL) algorithms based on brain MRI images have achieved great success in the prediction of Alzheimer’s disease (AD), with classification accuracy exceeding even that of the most experienced clinical experts. As a novel feature fusion method, Transformer has achieved excellent performan...

Full description

Saved in:
Bibliographic Details
Published in:Computers in biology and medicine 2023-09, Vol.164, p.107304-107304, Article 107304
Main Authors: Hu, Zhentao, Li, Yanyang, Wang, Zheng, Zhang, Shuo, Hou, Wei
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Deep learning (DL) algorithms based on brain MRI images have achieved great success in the prediction of Alzheimer’s disease (AD), with classification accuracy exceeding even that of the most experienced clinical experts. As a novel feature fusion method, Transformer has achieved excellent performance in many computer vision tasks, which also greatly promotes the application of Transformer in medical images. However, when Transformer is used for 3D MRI image feature fusion, existing DL models treat the input local features equally, which is inconsistent with the fact that adjacent voxels have stronger semantic connections than spatially distant voxels. In addition, due to the relatively small size of the dataset for medical images, it is difficult to capture local lesion features in limited iterative training by treating all input features equally. This paper proposes a deep learning model Conv-Swinformer that focuses on extracting and integrating local fine-grained features. Conv-Swinformer consists of a CNN module and a Transformer encoder module. The CNN module summarizes the planar features of the MRI slices, and the Transformer module establishes semantic connections in 3D space for these planar features. By introducing the shift window attention mechanism in the Transformer encoder, the attention is focused on a small spatial area of the MRI image, which effectively reduces unnecessary background semantic information and enables the model to capture local features more accurately. In addition, the layer-by-layer enlarged attention window can further integrate local fine-grained features, thus enhancing the model’s attention ability. Compared with DL algorithms that indiscriminately fuse local features of MRI images, Conv-Swinformer can fine-grained extract local lesion features, thus achieving better classification results. •Combining CNN and Transformer to build a classification model for Alzheimer’s disease.•A shift window attention mechanism is introduced in Transformer to enhance the extraction of local features of MRI.•The performance of the model was compared on MRI slices in three directions.•The proposed model is validated on ADNI and OASIS.•The performance of the model under different data enhancement methods is compared.
ISSN:0010-4825
1879-0534
DOI:10.1016/j.compbiomed.2023.107304