Loading…

Visual interpretation of deep learning model in ECG classification: A comprehensive evaluation of feature attribution methods

Feature attribution methods can visually highlight specific input regions containing influential aspects affecting a deep learning model's prediction. Recently, the use of feature attribution methods in electrocardiogram (ECG) classification has been sharply increasing, as they assist clinician...

Full description

Saved in:

Bibliographic Details
Published in:	Computers in biology and medicine 2024-11, Vol.182, p.109088, Article 109088
Main Authors:	Suh, Jangwon, Kim, Jimyeong, Kwon, Soonil, Jung, Euna, Ahn, Hyo-Jeong, Lee, Kyung-Yeon, Choi, Eue-Keun, Rhee, Wonjong
Format:	Article
Language:	English
Subjects:	Cardiac arrhythmia Cardiology Classification Databases, Factual Datasets Decision making Deep Learning EKG Electrocardiogram Electrocardiography Electrocardiography - methods Experiments Explainable artificial intelligence Feature attribution Heart Humans Interpretability Localization Medical research Methods Signal Processing, Computer-Assisted Visual aspects Visual discrimination learning
Citations:	Items that this one cites
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Feature attribution methods can visually highlight specific input regions containing influential aspects affecting a deep learning model's prediction. Recently, the use of feature attribution methods in electrocardiogram (ECG) classification has been sharply increasing, as they assist clinicians in understanding the model's decision-making process and assessing the model's reliability. However, a careful study to identify suitable methods for ECG datasets has been lacking, leading researchers to select methods without a thorough understanding of their appropriateness. In this work, we conduct a large-scale assessment by considering eleven popular feature attribution methods across five large ECG datasets using a model based on the ResNet-18 architecture. Our experiments include both automatic evaluations and human evaluations. Annotated datasets were utilized for automatic evaluations and three cardiac experts were involved for human evaluations. We found that Guided Grad-CAM, particularly when its absolute values are utilized, achieves the best performance. When Guided Grad-CAM was utilized as the feature attribution method, cardiac experts confirmed that it can identify diagnostically relevant electrophysiological characteristics, although its effectiveness varied across the 17 different diagnoses that we have investigated. •A large-scale assessment of feature attribution methods is provided.•Eleven feature attribution methods are considered over five large ECG datasets.•Both automatic and human evaluations are performed.•In our experiments, Guided Grad-CAM exhibited outstanding performance.
ISSN:	0010-4825 1879-0534 1879-0534
DOI:	10.1016/j.compbiomed.2024.109088