Loading…

PheValuator: Development and evaluation of a phenotype algorithm evaluator

[Display omitted] •Phenotype Algorithms (PAs) are used in research to determine presence of disease.•Evaluation of PAs for sensitivity/specificity/predictive values is rarely performed.•PheValuator uses diagnostic predictive modeling to perform PA evaluation.•The tool provides conservative sensitivi...

Full description

Saved in:
Bibliographic Details
Published in:Journal of biomedical informatics 2019-09, Vol.97, p.103258-103258, Article 103258
Main Authors: Swerdel, Joel N., Hripcsak, George, Ryan, Patrick B.
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:[Display omitted] •Phenotype Algorithms (PAs) are used in research to determine presence of disease.•Evaluation of PAs for sensitivity/specificity/predictive values is rarely performed.•PheValuator uses diagnostic predictive modeling to perform PA evaluation.•The tool provides conservative sensitivity/specificity/predictive estimates for PAs.•PheValuator shows promise as a tool to assess PA performance characteristics. The primary approach for defining disease in observational healthcare databases is to construct phenotype algorithms (PAs), rule-based heuristics predicated on the presence, absence, and temporal logic of clinical observations. However, a complete evaluation of PAs, i.e., determining sensitivity, specificity, and positive predictive value (PPV), is rarely performed. In this study, we propose a tool (PheValuator) to efficiently estimate a complete PA evaluation. We used 4 administrative claims datasets: OptumInsight’s de-identified Clinformatics™ Datamart (Eden Prairie,MN); IBM MarketScan Multi-State Medicaid); IBM MarketScan Medicare Supplemental Beneficiaries; and IBM MarketScan Commercial Claims and Encounters from 2000 to 2017. Using PheValuator involves (1) creating a diagnostic predictive model for the phenotype, (2) applying the model to a large set of randomly selected subjects, and (3) comparing each subject’s predicted probability for the phenotype to inclusion/exclusion in PAs. We used the predictions as a ‘probabilistic gold standard’ measure to classify positive/negative cases. We examined 4 phenotypes: myocardial infarction, cerebral infarction, chronic kidney disease, and atrial fibrillation. We examined several PAs for each phenotype including 1-time (1X) occurrence of the diagnosis code in the subject’s record and 1-time occurrence of the diagnosis in an inpatient setting with the diagnosis code as the primary reason for admission (1X-IP-1stPos). Across phenotypes, the 1X PA showed the highest sensitivity/lowest PPV among all PAs. 1X-IP-1stPos yielded the highest PPV/lowest sensitivity. Specificity was very high across algorithms. We found similar results between algorithms across datasets. PheValuator appears to show promise as a tool to estimate PA performance characteristics.
ISSN:1532-0464
1532-0480
DOI:10.1016/j.jbi.2019.103258