Loading…

A Novel Deep Similarity Learning Approach to Electronic Health Records Data

The past decade has seen a tremendous advancement in using Electronic Health Records (EHRs) to offer clinical decision support and provide personalized healthcare to patients. Despite the potential benefits offered by EHR data, it is challenging to represent and analyze large EHRs for predictive mod...

Full description

Saved in:
Bibliographic Details
Published in:IEEE access 2020, Vol.8, p.209278-209295
Main Authors: Gupta, Vagisha, Sachdeva, Shelly, Bhalla, Subhash
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The past decade has seen a tremendous advancement in using Electronic Health Records (EHRs) to offer clinical decision support and provide personalized healthcare to patients. Despite the potential benefits offered by EHR data, it is challenging to represent and analyze large EHRs for predictive modeling due to heterogeneity, high dimensionality, and sparsity. This article proposes a novel supervised Deep Similarity Learning approach that learns the patient representations and also finds the relationship between the patients using pairwise similarity learning to facilitate predictive analysis for personalized healthcare. We develop CNN_Softmax which is a Siamese-based neural network for multi-class classification methods corresponding to the prediction of disease. It uses Convolutional Neural Network (CNN) to study the vector representation of raw EHRs and capture essential information of patient features, and a Softmax-based supervised classification method that learns the similarity between pairs of patients and performs disease prediction using this similarity information. Our approach uses data type mapping to handle heterogeneity and the polynomial interpolation method to handle sparsity existing in EHR data. ORBDA, which is an openEHR (standard) benchmark dataset, is used for evaluating this study. Experimental results show that CNN_Softmax achieves an accuracy of 97.8%, a recall of 98.1%, a precision of 96.02%, and an F1 score of 97.82%. The comparative results show that our proposed novel methodology performs disease prediction with highly promising results and outperforms state-of-the-art similarity learning methods. The current study is the first attempt to perform disease prediction on standardized EHRs, to the best of the authors' knowledge. The deep similarity learning approach provides support for clinical decision making that is more reliable and generalizable than previous approaches and focuses on dealing with heterogeneous and sparse data. The concept also serves as a new implementation of artificial intelligence technologies for the application of clinical big data.
ISSN:2169-3536
2169-3536
DOI:10.1109/ACCESS.2020.3037710