Loading…

Detecting anomalous sequences in electronic health records using higher-order tensor networks

Detecting anomalous sequences is an integral part of building and protecting modern large-scale health information technology (HIT) systems. These HIT systems generate a large volume of records of patients’ state and significant events, which provide a valuable resource to help improve clinical deci...

Full description

Saved in:
Bibliographic Details
Published in:Journal of biomedical informatics 2022-11, Vol.135, p.104219-104219, Article 104219
Main Authors: Niu, Haoran, Omitaomu, Olufemi A., Langston, Michael A., Olama, Mohammad, Ozmen, Ozgur, Klasky, Hilda B., Laurio, Angela, Sauer, Brian, Ward, Merry, Nebeker, Jonathan
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Detecting anomalous sequences is an integral part of building and protecting modern large-scale health information technology (HIT) systems. These HIT systems generate a large volume of records of patients’ state and significant events, which provide a valuable resource to help improve clinical decisions, patient care processes, and other issues. However, detecting anomalous sequences in electronic health records (EHR) remains a challenge in healthcare applications for several reasons, including imbalances in the data, complexity of relationships between events in the sequence, and the curse of dimensionality. Conventional anomaly detection methods use the finite sequence of events to discriminate sequences. They fail to incorporate salient event details under variable higher-order dependencies (e.g., duration between events) that can provide better discrimination of sequences in their models. To address this problem, we propose event sequence and subsequence anomaly detection algorithms that (1) use network-based representations of interactions in the data, (2) account for variable higher-order dependencies in the data, and (3) incorporate events duration for adequate discrimination of the data. The proposed approach identifies anomalies by monitoring the change in the graph after the test sequence is removed from the network. The change is quantified using graph distance metrics so that dramatic changes in the network can be attributed to the removed sequence. Furthermore, the proposed subsequence algorithm recommends plausible paths and salient information for the detected anomalous subsequences. Our results show that the proposed event sequence anomaly detection algorithm outperforms the baseline methods for both synthetic data and real-world EHR data. [Display omitted] •The detection of anomalous sequences in EHR remains a challenge in healthcare.•We propose new anomalous sequence detection algorithms for healthcare applications.•The algorithms consider EHR data as a graph and use higher-order dependencies.•The algorithms identify anomalies based on dramatic changes in the graph distance.•Our solution outperforms the baseline methods that used first-order dependency.
ISSN:1532-0464
1532-0480
DOI:10.1016/j.jbi.2022.104219