Loading…

A metadata framework for computational phenotypes

Abstract With the burgeoning development of computational phenotypes, it is increasingly difficult to identify the right phenotype for the right tasks. This study uses a mixed-methods approach to develop and evaluate a novel metadata framework for retrieval of and reusing computational phenotypes. T...

Full description

Saved in:
Bibliographic Details
Published in:JAMIA open 2023-07, Vol.6 (2), p.ooad032-ooad032
Main Authors: Spotnitz, Matthew, Acharya, Nripendra, Cimino, James J, Murphy, Shawn, Namjou, Bahram, Crimmins, Nancy, Walunas, Theresa, Liu, Cong, Crosslin, David, Benoit, Barbara, Rosenthal, Elisabeth, Pacheco, Jennifer A, Ostropolets, Anna, Reyes Nieva, Harry, Patterson, Jason S, Richter, Lauren R, Callahan, Tiffany J, Elhussein, Ahmed, Pang, Chao, Kiryluk, Krzysztof, Nestor, Jordan, Khan, Atlas, Mohan, Sumit, Minty, Evan, Chung, Wendy, Wei, Wei-Qi, Natarajan, Karthik, Weng, Chunhua
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract With the burgeoning development of computational phenotypes, it is increasingly difficult to identify the right phenotype for the right tasks. This study uses a mixed-methods approach to develop and evaluate a novel metadata framework for retrieval of and reusing computational phenotypes. Twenty active phenotyping researchers from 2 large research networks, Electronic Medical Records and Genomics and Observational Health Data Sciences and Informatics, were recruited to suggest metadata elements. Once consensus was reached on 39 metadata elements, 47 new researchers were surveyed to evaluate the utility of the metadata framework. The survey consisted of 5-Likert multiple-choice questions and open-ended questions. Two more researchers were asked to use the metadata framework to annotate 8 type-2 diabetes mellitus phenotypes. More than 90% of the survey respondents rated metadata elements regarding phenotype definition and validation methods and metrics positively with a score of 4 or 5. Both researchers completed annotation of each phenotype within 60 min. Our thematic analysis of the narrative feedback indicates that the metadata framework was effective in capturing rich and explicit descriptions and enabling the search for phenotypes, compliance with data standards, and comprehensive validation metrics. Current limitations were its complexity for data collection and the entailed human costs. Lay Summary Computational phenotypes are essential to scale precision medicine and observational healthcare research. More comprehensive and explicitly defined phenotype metadata could improve phenotype retrieval, reuse, and sharing. However, few studies have focused directly on phenotype metadata explicitness or validation methods and metrics. We designed a phenotype metadata framework as part of ongoing research with the Electronic Medical Records and Genomics (eMERGE) network phenotyping working group. We identified 39 metadata elements based on group consensus. We distributed a survey to 47 new researchers that rated the usefulness of each metadata element on a scale of 1–5, and conducted a thematic analysis of the free-text survey questions. Two researchers annotated 8 type-2 diabetes mellitus phenotypes with the framework. More than 90% of respondents assigned a rating of 4–5 to metadata framework elements regarding phenotype definition and validation metrics. In our thematic analysis, explicit descriptions, compliance with data standards, and comprehens
ISSN:2574-2531
2574-2531
DOI:10.1093/jamiaopen/ooad032