Loading…

Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review

[Display omitted] •A systematic review of the current works pertinent to patient representation learning.•A growing trend in building deep learning based patient representations from EHRs.•The learned representations attempt to gain a cohesive picture of a patient’s data.•Capabilities of deep learni...

Full description

Saved in:
Bibliographic Details
Published in:Journal of biomedical informatics 2021-03, Vol.115, p.103671-103671, Article 103671
Main Authors: Si, Yuqi, Du, Jingcheng, Li, Zhao, Jiang, Xiaoqian, Miller, Timothy, Wang, Fei, Jim Zheng, W., Roberts, Kirk
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:[Display omitted] •A systematic review of the current works pertinent to patient representation learning.•A growing trend in building deep learning based patient representations from EHRs.•The learned representations attempt to gain a cohesive picture of a patient’s data.•Capabilities of deep learning models can largely address the challenges of EHR data.•Future work: advanced learning methods to obtain robust, and precise representations. Patient representation learning refers to learning a dense mathematical representation of a patient that encodes meaningful information from Electronic Health Records (EHRs). This is generally performed using advanced deep learning methods. This study presents a systematic review of this field and provides both qualitative and quantitative analyses from a methodological perspective. We identified studies developing patient representations from EHRs with deep learning methods from MEDLINE, EMBASE, Scopus, the Association for Computing Machinery (ACM) Digital Library, and the Institute of Electrical and Electronics Engineers (IEEE) Xplore Digital Library. After screening 363 articles, 49 papers were included for a comprehensive data collection. Publications developing patient representations almost doubled each year from 2015 until 2019. We noticed a typical workflow starting with feeding raw data, applying deep learning models, and ending with clinical outcome predictions as evaluations of the learned representations. Specifically, learning representations from structured EHR data was dominant (37 out of 49 studies). Recurrent Neural Networks were widely applied as the deep learning architecture (Long short-term memory: 13 studies, Gated recurrent unit: 11 studies). Learning was mainly performed in a supervised manner (30 studies) optimized with cross-entropy loss. Disease prediction was the most common application and evaluation (31 studies). Benchmark datasets were mostly unavailable (28 studies) due to privacy concerns of EHR data, and code availability was assured in 20 studies. The existing predictive models mainly focus on the prediction of single diseases, rather than considering the complex mechanisms of patients from a holistic review. We show the importance and feasibility of learning comprehensive representations of patient EHR data through a systematic review. Advances in patient representation learning techniques will be essential for powering patient-level EHR analyses. Future work will still be devoted to l
ISSN:1532-0464
1532-0480
1532-0480
DOI:10.1016/j.jbi.2020.103671