Loading…

Natural language processing of implantable cardioverter-defibrillator reports in hypertrophic cardiomyopathy: A paradigm for longitudinal device follow-up

The follow-up of implantable cardioverter-defibrillators (ICDs) generates large amounts of valuable structured and unstructured data embedded in device interrogation reports. We aimed to build a natural language processing (NLP) model for automated capture of ICD-recorded events from device interrog...

Full description

Saved in:
Bibliographic Details
Published in:Cardiovascular digital health journal 2021-10, Vol.2 (5), p.264-269
Main Authors: Siontis, Konstantinos C., Bhopalwala, Huzefa, Dewaswala, Nakeya, Scott, Christopher G., Noseworthy, Peter A., Geske, Jeffrey B., Ommen, Steve R., Nishimura, Rick A., Ackerman, Michael J., Friedman, Paul A., Arruda-Olson, Adelaide M.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The follow-up of implantable cardioverter-defibrillators (ICDs) generates large amounts of valuable structured and unstructured data embedded in device interrogation reports. We aimed to build a natural language processing (NLP) model for automated capture of ICD-recorded events from device interrogation reports using a single-center cohort of patients with hypertrophic cardiomyopathy (HCM). A total of 687 ICD interrogation reports from 247 HCM patients were included. Using a derivation set of 480 reports, we developed a rule-based NLP algorithm based on unstructured (free-text) data from the interpretation field of the ICD reports to identify sustained atrial and ventricular arrhythmias, and ICD therapies. A separate model based on structured numerical tabulated data was also developed. Both models were tested in a separate set of the 207 remaining ICD reports. Diagnostic performance was determined in reference to arrhythmia and ICD therapy annotations generated by expert manual review of the same reports. The NLP system achieved sensitivity 0.98 and 0.99, and F1-scores 0.98 and 0.92 for arrhythmia and ICD therapy events, respectively. In contrast, the performance of the structured data model was significantly lower with sensitivity 0.33 and 0.76, and F1-scores 0.45 and 0.78, for arrhythmia and ICD therapy events, respectively. An automated NLP system can capture arrhythmia events and ICD therapies from unstructured device interrogation reports with high accuracy in HCM. These findings demonstrate the feasibility of an NLP paradigm for the extraction of data for clinical care and research from ICD reports embedded in the electronic health record.
ISSN:2666-6936
2666-6936
DOI:10.1016/j.cvdhj.2021.05.005