Loading…

Comparative evaluation of multiomics integration tools for the study of prediabetes: insights into the earliest stages of type 2 diabetes mellitus

Type 2 diabetes mellitus (T2D) remains a critical health concern, particularly in its early disease stages such as prediabetes. Understanding these early stages is paramount for improving patient outcomes. Multiomics data integration tools offer promise in unraveling the underlying mechanisms of T2D...

Full description

Saved in:
Bibliographic Details
Published in:Network modeling and analysis in health informatics and bioinformatics (Wien) 2024-03, Vol.13 (1), p.8, Article 8
Main Authors: Emam, Mohamed, Tarek, Ahmed, Soudy, Mohamed, Antunes, Agostinho, Hadidi, Mohamed El, Hamed, Mohamed
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c270t-8cb4d0bfff1e3ededdc99d86ecb4070ce7570004280fc033a4c103500ea06d13
container_end_page
container_issue 1
container_start_page 8
container_title Network modeling and analysis in health informatics and bioinformatics (Wien)
container_volume 13
creator Emam, Mohamed
Tarek, Ahmed
Soudy, Mohamed
Antunes, Agostinho
Hadidi, Mohamed El
Hamed, Mohamed
description Type 2 diabetes mellitus (T2D) remains a critical health concern, particularly in its early disease stages such as prediabetes. Understanding these early stages is paramount for improving patient outcomes. Multiomics data integration tools offer promise in unraveling the underlying mechanisms of T2D. The advent of high-throughput technology and the increasing availability of multiomics data has led to the development of several statistical and network-based integration methods. However, the performance of such methods varies, requiring their output evaluation in an unbiased manner. Here, we conducted a comparative analysis of three represented unsupervised multiomics integration tools, MOFA + , GFA, and ICluster alongside an in-house supervised model EMFR, using two complementary benchmarks. First, we assessed how well the features selected by each tool could discriminate between patient and control samples using both linear and nonlinear classification models. Second, we quantified how much each type of omics data-selected features contributed to the total variance. Through such detailed comparisons between the unsupervised, we observed that the features selected by MOFA + and GFA gave the best F1 score (0.7) in the nonlinear classification model, clearly discriminating between patient and control classes. Hence, we recommend these two unsupervised integration tools for feature selection purposes. Our comparative analyses were conducted on a real biological dataset to further study prediabetes patients. Such multiomics data enabled the detection of prediabetes subtypes and provided several clinical insights that will open a new gate toward the era of personalized medicine for diabetic disease.
doi_str_mv 10.1007/s13721-024-00442-9
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2956972082</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2956972082</sourcerecordid><originalsourceid>FETCH-LOGICAL-c270t-8cb4d0bfff1e3ededdc99d86ecb4070ce7570004280fc033a4c103500ea06d13</originalsourceid><addsrcrecordid>eNp9kF1rwyAUhsPYYKXrH9iVsOtsR_NhsrtR9gWD3fRerJ6klqRmagr9G_vFs83GdjURPOjzHPVNkmsKtxSA33macUZTYHkKkOcsrc-SGaM1S8uSw_mf-jJZeL-FOKo4aTFLPpe2H6STweyR4F52YyztjtiG9GMXy94oT8wuYOumk2Bt50ljHQkbJD6M-nCkB4fayDUG9PeR96bdhJNoTxxK1xn0IQqyRX80wmFAwsiPRXrsOhNGf5VcNLLzuPhe58nq6XG1fEnf3p9flw9vqWIcQlqpda5h3TQNxQw1aq3qWlclxn3goJAXPH40ZxU0CrJM5opCVgCghFLTbJ7cTG0HZz_G-DSxtaPbxRsFq4uy5gwqFik2UcpZ7x02YnCml-4gKIhj-mJKX8T0xSl9UUcpmyQf4V2L7rf1P9YXbceLLA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2956972082</pqid></control><display><type>article</type><title>Comparative evaluation of multiomics integration tools for the study of prediabetes: insights into the earliest stages of type 2 diabetes mellitus</title><source>Springer Link</source><creator>Emam, Mohamed ; Tarek, Ahmed ; Soudy, Mohamed ; Antunes, Agostinho ; Hadidi, Mohamed El ; Hamed, Mohamed</creator><creatorcontrib>Emam, Mohamed ; Tarek, Ahmed ; Soudy, Mohamed ; Antunes, Agostinho ; Hadidi, Mohamed El ; Hamed, Mohamed</creatorcontrib><description>Type 2 diabetes mellitus (T2D) remains a critical health concern, particularly in its early disease stages such as prediabetes. Understanding these early stages is paramount for improving patient outcomes. Multiomics data integration tools offer promise in unraveling the underlying mechanisms of T2D. The advent of high-throughput technology and the increasing availability of multiomics data has led to the development of several statistical and network-based integration methods. However, the performance of such methods varies, requiring their output evaluation in an unbiased manner. Here, we conducted a comparative analysis of three represented unsupervised multiomics integration tools, MOFA + , GFA, and ICluster alongside an in-house supervised model EMFR, using two complementary benchmarks. First, we assessed how well the features selected by each tool could discriminate between patient and control samples using both linear and nonlinear classification models. Second, we quantified how much each type of omics data-selected features contributed to the total variance. Through such detailed comparisons between the unsupervised, we observed that the features selected by MOFA + and GFA gave the best F1 score (0.7) in the nonlinear classification model, clearly discriminating between patient and control classes. Hence, we recommend these two unsupervised integration tools for feature selection purposes. Our comparative analyses were conducted on a real biological dataset to further study prediabetes patients. Such multiomics data enabled the detection of prediabetes subtypes and provided several clinical insights that will open a new gate toward the era of personalized medicine for diabetic disease.</description><identifier>ISSN: 2192-6670</identifier><identifier>ISSN: 2192-6662</identifier><identifier>EISSN: 2192-6670</identifier><identifier>DOI: 10.1007/s13721-024-00442-9</identifier><language>eng</language><publisher>Vienna: Springer Vienna</publisher><subject>Applications of Graph Theory and Complex Networks ; Bioinformatics ; Biological analysis ; Classification ; Comparative analysis ; Computational Biology/Bioinformatics ; Computer Science ; Cytokines ; Data integration ; Datasets ; Diabetes ; Diabetes mellitus ; Diabetes mellitus (non-insulin dependent) ; Feature selection ; Health Informatics ; Integration ; Methods ; Original Article ; Precision medicine</subject><ispartof>Network modeling and analysis in health informatics and bioinformatics (Wien), 2024-03, Vol.13 (1), p.8, Article 8</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Austria, part of Springer Nature 2024 Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><rights>Copyright Springer Nature B.V. Dec 2024</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c270t-8cb4d0bfff1e3ededdc99d86ecb4070ce7570004280fc033a4c103500ea06d13</cites><orcidid>0000-0002-1328-1732</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Emam, Mohamed</creatorcontrib><creatorcontrib>Tarek, Ahmed</creatorcontrib><creatorcontrib>Soudy, Mohamed</creatorcontrib><creatorcontrib>Antunes, Agostinho</creatorcontrib><creatorcontrib>Hadidi, Mohamed El</creatorcontrib><creatorcontrib>Hamed, Mohamed</creatorcontrib><title>Comparative evaluation of multiomics integration tools for the study of prediabetes: insights into the earliest stages of type 2 diabetes mellitus</title><title>Network modeling and analysis in health informatics and bioinformatics (Wien)</title><addtitle>Netw Model Anal Health Inform Bioinforma</addtitle><description>Type 2 diabetes mellitus (T2D) remains a critical health concern, particularly in its early disease stages such as prediabetes. Understanding these early stages is paramount for improving patient outcomes. Multiomics data integration tools offer promise in unraveling the underlying mechanisms of T2D. The advent of high-throughput technology and the increasing availability of multiomics data has led to the development of several statistical and network-based integration methods. However, the performance of such methods varies, requiring their output evaluation in an unbiased manner. Here, we conducted a comparative analysis of three represented unsupervised multiomics integration tools, MOFA + , GFA, and ICluster alongside an in-house supervised model EMFR, using two complementary benchmarks. First, we assessed how well the features selected by each tool could discriminate between patient and control samples using both linear and nonlinear classification models. Second, we quantified how much each type of omics data-selected features contributed to the total variance. Through such detailed comparisons between the unsupervised, we observed that the features selected by MOFA + and GFA gave the best F1 score (0.7) in the nonlinear classification model, clearly discriminating between patient and control classes. Hence, we recommend these two unsupervised integration tools for feature selection purposes. Our comparative analyses were conducted on a real biological dataset to further study prediabetes patients. Such multiomics data enabled the detection of prediabetes subtypes and provided several clinical insights that will open a new gate toward the era of personalized medicine for diabetic disease.</description><subject>Applications of Graph Theory and Complex Networks</subject><subject>Bioinformatics</subject><subject>Biological analysis</subject><subject>Classification</subject><subject>Comparative analysis</subject><subject>Computational Biology/Bioinformatics</subject><subject>Computer Science</subject><subject>Cytokines</subject><subject>Data integration</subject><subject>Datasets</subject><subject>Diabetes</subject><subject>Diabetes mellitus</subject><subject>Diabetes mellitus (non-insulin dependent)</subject><subject>Feature selection</subject><subject>Health Informatics</subject><subject>Integration</subject><subject>Methods</subject><subject>Original Article</subject><subject>Precision medicine</subject><issn>2192-6670</issn><issn>2192-6662</issn><issn>2192-6670</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kF1rwyAUhsPYYKXrH9iVsOtsR_NhsrtR9gWD3fRerJ6klqRmagr9G_vFs83GdjURPOjzHPVNkmsKtxSA33macUZTYHkKkOcsrc-SGaM1S8uSw_mf-jJZeL-FOKo4aTFLPpe2H6STweyR4F52YyztjtiG9GMXy94oT8wuYOumk2Bt50ljHQkbJD6M-nCkB4fayDUG9PeR96bdhJNoTxxK1xn0IQqyRX80wmFAwsiPRXrsOhNGf5VcNLLzuPhe58nq6XG1fEnf3p9flw9vqWIcQlqpda5h3TQNxQw1aq3qWlclxn3goJAXPH40ZxU0CrJM5opCVgCghFLTbJ7cTG0HZz_G-DSxtaPbxRsFq4uy5gwqFik2UcpZ7x02YnCml-4gKIhj-mJKX8T0xSl9UUcpmyQf4V2L7rf1P9YXbceLLA</recordid><startdate>20240314</startdate><enddate>20240314</enddate><creator>Emam, Mohamed</creator><creator>Tarek, Ahmed</creator><creator>Soudy, Mohamed</creator><creator>Antunes, Agostinho</creator><creator>Hadidi, Mohamed El</creator><creator>Hamed, Mohamed</creator><general>Springer Vienna</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><scope>K9.</scope><orcidid>https://orcid.org/0000-0002-1328-1732</orcidid></search><sort><creationdate>20240314</creationdate><title>Comparative evaluation of multiomics integration tools for the study of prediabetes: insights into the earliest stages of type 2 diabetes mellitus</title><author>Emam, Mohamed ; Tarek, Ahmed ; Soudy, Mohamed ; Antunes, Agostinho ; Hadidi, Mohamed El ; Hamed, Mohamed</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c270t-8cb4d0bfff1e3ededdc99d86ecb4070ce7570004280fc033a4c103500ea06d13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Applications of Graph Theory and Complex Networks</topic><topic>Bioinformatics</topic><topic>Biological analysis</topic><topic>Classification</topic><topic>Comparative analysis</topic><topic>Computational Biology/Bioinformatics</topic><topic>Computer Science</topic><topic>Cytokines</topic><topic>Data integration</topic><topic>Datasets</topic><topic>Diabetes</topic><topic>Diabetes mellitus</topic><topic>Diabetes mellitus (non-insulin dependent)</topic><topic>Feature selection</topic><topic>Health Informatics</topic><topic>Integration</topic><topic>Methods</topic><topic>Original Article</topic><topic>Precision medicine</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Emam, Mohamed</creatorcontrib><creatorcontrib>Tarek, Ahmed</creatorcontrib><creatorcontrib>Soudy, Mohamed</creatorcontrib><creatorcontrib>Antunes, Agostinho</creatorcontrib><creatorcontrib>Hadidi, Mohamed El</creatorcontrib><creatorcontrib>Hamed, Mohamed</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><jtitle>Network modeling and analysis in health informatics and bioinformatics (Wien)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Emam, Mohamed</au><au>Tarek, Ahmed</au><au>Soudy, Mohamed</au><au>Antunes, Agostinho</au><au>Hadidi, Mohamed El</au><au>Hamed, Mohamed</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Comparative evaluation of multiomics integration tools for the study of prediabetes: insights into the earliest stages of type 2 diabetes mellitus</atitle><jtitle>Network modeling and analysis in health informatics and bioinformatics (Wien)</jtitle><stitle>Netw Model Anal Health Inform Bioinforma</stitle><date>2024-03-14</date><risdate>2024</risdate><volume>13</volume><issue>1</issue><spage>8</spage><pages>8-</pages><artnum>8</artnum><issn>2192-6670</issn><issn>2192-6662</issn><eissn>2192-6670</eissn><abstract>Type 2 diabetes mellitus (T2D) remains a critical health concern, particularly in its early disease stages such as prediabetes. Understanding these early stages is paramount for improving patient outcomes. Multiomics data integration tools offer promise in unraveling the underlying mechanisms of T2D. The advent of high-throughput technology and the increasing availability of multiomics data has led to the development of several statistical and network-based integration methods. However, the performance of such methods varies, requiring their output evaluation in an unbiased manner. Here, we conducted a comparative analysis of three represented unsupervised multiomics integration tools, MOFA + , GFA, and ICluster alongside an in-house supervised model EMFR, using two complementary benchmarks. First, we assessed how well the features selected by each tool could discriminate between patient and control samples using both linear and nonlinear classification models. Second, we quantified how much each type of omics data-selected features contributed to the total variance. Through such detailed comparisons between the unsupervised, we observed that the features selected by MOFA + and GFA gave the best F1 score (0.7) in the nonlinear classification model, clearly discriminating between patient and control classes. Hence, we recommend these two unsupervised integration tools for feature selection purposes. Our comparative analyses were conducted on a real biological dataset to further study prediabetes patients. Such multiomics data enabled the detection of prediabetes subtypes and provided several clinical insights that will open a new gate toward the era of personalized medicine for diabetic disease.</abstract><cop>Vienna</cop><pub>Springer Vienna</pub><doi>10.1007/s13721-024-00442-9</doi><orcidid>https://orcid.org/0000-0002-1328-1732</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 2192-6670
ispartof Network modeling and analysis in health informatics and bioinformatics (Wien), 2024-03, Vol.13 (1), p.8, Article 8
issn 2192-6670
2192-6662
2192-6670
language eng
recordid cdi_proquest_journals_2956972082
source Springer Link
subjects Applications of Graph Theory and Complex Networks
Bioinformatics
Biological analysis
Classification
Comparative analysis
Computational Biology/Bioinformatics
Computer Science
Cytokines
Data integration
Datasets
Diabetes
Diabetes mellitus
Diabetes mellitus (non-insulin dependent)
Feature selection
Health Informatics
Integration
Methods
Original Article
Precision medicine
title Comparative evaluation of multiomics integration tools for the study of prediabetes: insights into the earliest stages of type 2 diabetes mellitus
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T13%3A46%3A15IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Comparative%20evaluation%20of%20multiomics%20integration%20tools%20for%20the%20study%20of%20prediabetes:%20insights%20into%20the%20earliest%20stages%20of%20type%202%20diabetes%20mellitus&rft.jtitle=Network%20modeling%20and%20analysis%20in%20health%20informatics%20and%20bioinformatics%20(Wien)&rft.au=Emam,%20Mohamed&rft.date=2024-03-14&rft.volume=13&rft.issue=1&rft.spage=8&rft.pages=8-&rft.artnum=8&rft.issn=2192-6670&rft.eissn=2192-6670&rft_id=info:doi/10.1007/s13721-024-00442-9&rft_dat=%3Cproquest_cross%3E2956972082%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c270t-8cb4d0bfff1e3ededdc99d86ecb4070ce7570004280fc033a4c103500ea06d13%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2956972082&rft_id=info:pmid/&rfr_iscdi=true