Loading…
Efficient self-attention mechanism and structural distilling model for Alzheimer’s disease diagnosis
Structural magnetic resonance imaging (sMRI) is commonly used for the identification of Alzheimer’s disease because of its keen insight into atrophy-induced changes in brain structure. Current mainstream convolutional neural network-based deep learning methods ignore the long-term dependencies betwe...
Saved in:
Published in: | Computers in biology and medicine 2022-08, Vol.147, p.105737-105737, Article 105737 |
---|---|
Main Authors: | , , , , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Structural magnetic resonance imaging (sMRI) is commonly used for the identification of Alzheimer’s disease because of its keen insight into atrophy-induced changes in brain structure. Current mainstream convolutional neural network-based deep learning methods ignore the long-term dependencies between voxels; thus, it is challenging to learn the global features of sMRI data. In this study, an advanced deep learning architecture called Brain Informer (BraInf) was developed based on an efficient self-attention mechanism. The proposed model integrates representation learning, feature distilling, and classifier modeling into a unified framework. First, the proposed model uses a multihead ProbSparse self-attention block for representation learning. This self-attention mechanism selects the first ⌊lnN⌋ elements that can represent the overall features from the perspective of probability sparsity, which significantly reduces computational cost. Subsequently, a structural distilling block is proposed that applies the concept of patch merging to the distilling operation. The block reduces the size of the three-dimensional tensor and further lowers the memory cost while preserving the original data as much as possible. Thus, there was a significant improvement in the space complexity. Finally, the feature vector was projected into the classification target space for disease prediction. The effectiveness of the proposed model was validated using the Alzheimer’s Disease Neuroimaging Initiative dataset. The model achieved 97.97% and 91.89% accuracy on Alzheimer’s disease and mild cognitive impairment classification tasks, respectively. The experimental results also demonstrate that the proposed framework outperforms several state-of-the-art methods.
•Self-attention mechanism can capture long-term dependencies of MRI brain regions.•Structural distilling reduces memory cost and improves classification performance.•Significant performance improvement is validated compared with mainstream methods.•The proposed model used a data-driven method without relying on prior knowledge. |
---|---|
ISSN: | 0010-4825 1879-0534 |
DOI: | 10.1016/j.compbiomed.2022.105737 |