Loading…
OFGST-Swin: Swin Transformer Utilizing Overlap Fusion-Based Generalized S-Transform for Respiratory Cycle Classification
Respiratory diseases pose a massive threat to human health; thus, early diagnosis and treatment are essential. Although electronic stethoscopes have shown effectiveness in enhancing auscultation, the diagnosis still necessitates the expertise of a specialist. In this article, we propose a Swin Trans...
Saved in:
Published in: | IEEE transactions on instrumentation and measurement 2024, Vol.73, p.1-13 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Respiratory diseases pose a massive threat to human health; thus, early diagnosis and treatment are essential. Although electronic stethoscopes have shown effectiveness in enhancing auscultation, the diagnosis still necessitates the expertise of a specialist. In this article, we propose a Swin Transformer utilizing overlap fusion-based generalized S-transform (OFGST-Swin) for respiratory cycle classification. The proposed OFGST-Swin demonstrates the capability to categorize respiratory sounds captured by electronic stethoscopes and detect adventitious respiratory cycles within these recordings, and it consists of two novel modules: the sliding window-based augmentation (SWA) for respiratory cycle data enhancement, and the overlap fusion-based generalized S-transform (OFGST) for respiratory cycle feature extraction. The SWA addresses data imbalance in medical datasets by generating adventitious respiratory cycles through a sliding window. The OFGST incorporates the innovative triangular window-based overlap fusion (TWOF) into the enhanced generalized S-transform (EGST), for extracting respiratory cycle features. The proposed OFGST-Swin has been evaluated on two datasets: the ICBHI 2017 dataset and the SPRsound respiratory sound dataset. The experimental results indicate that the proposed OFGST-Swin achieves a better accuracy score of 0.5605 on four-category classification tasks in the ICBHI 2017 dataset, and 0.8018 on seven-category classification tasks in the SPRsound dataset. The proposed method, serving as a signal processing backend for electronic stethoscopes, offers highly effective diagnostic advice to physicians. |
---|---|
ISSN: | 0018-9456 1557-9662 |
DOI: | 10.1109/TIM.2024.3428637 |