Loading…

Parenclitic and Synolytic Networks Revisited

Parenclitic networks provide a powerful and relatively new way to coerce multidimensional data into a graph form, enabling the application of graph theory to evaluate features. Different algorithms have been published for constructing parenclitic networks, leading to the question-which algorithm sho...

Full description

Saved in:
Bibliographic Details
Published in:Frontiers in genetics 2021-10, Vol.12, p.733783-733783
Main Authors: Nazarenko, Tatiana, Whitwell, Harry J, Blyuss, Oleg, Zaikin, Alexey
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c417t-fe72ce677cc6603970503b0be0b27bcb2402a57fa755b11d975d78c9090cfae23
container_end_page 733783
container_issue
container_start_page 733783
container_title Frontiers in genetics
container_volume 12
creator Nazarenko, Tatiana
Whitwell, Harry J
Blyuss, Oleg
Zaikin, Alexey
description Parenclitic networks provide a powerful and relatively new way to coerce multidimensional data into a graph form, enabling the application of graph theory to evaluate features. Different algorithms have been published for constructing parenclitic networks, leading to the question-which algorithm should be chosen? Initially, it was suggested to calculate the weight of an edge between two nodes of the network as a deviation from a linear regression, calculated for a dependence of one of these features on the other. This method works well, but not when features do not have a linear relationship. To overcome this, it was suggested to calculate edge weights as the distance from the area of most probable values by using a kernel density estimation. In these two approaches only one class (typically controls or healthy population) is used to construct a model. To take account of a second class, we have introduced synolytic networks, using a boundary between two classes on the feature-feature plane to estimate the weight of the edge between these features. Common to all these approaches is that topological indices can be used to evaluate the structure represented by the graphs. To compare these network approaches alongside more traditional machine-learning algorithms, we performed a substantial analysis using both synthetic data with known structure and publicly available datasets used for the benchmarking of ML-algorithms. Such a comparison has shown that the main advantage of parenclitic and synolytic networks is their resistance to over-fitting (occurring when the number of features is greater than the number of subjects) compared to other ML approaches. Secondly, the capability to visualise data in a structured form, even when this structure is not available allows for visual inspection and the application of well-established graph theory to their interpretation/application, eliminating the "black-box" nature of other ML approaches.
doi_str_mv 10.3389/fgene.2021.733783
format article
fullrecord <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_doaj_primary_oai_doaj_org_article_ec9d7cbb8a034b1d9972ecfdb59090a5</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_ec9d7cbb8a034b1d9972ecfdb59090a5</doaj_id><sourcerecordid>2595103216</sourcerecordid><originalsourceid>FETCH-LOGICAL-c417t-fe72ce677cc6603970503b0be0b27bcb2402a57fa755b11d975d78c9090cfae23</originalsourceid><addsrcrecordid>eNpVkclOwzAQQC0EoqjwAVxQjxxI8RrHFyRUsUkIEMvZsp1JMaRxsdOi_j0JBVR8GXuW55EeQocEjxkr1Gk1hQbGFFMylozJgm2hPZLnPCu61PbGfYAOUnrD3eGKMcZ30YBxyQUldA-dPJgIjat9693INOXoadWEetW_7qD9DPE9jR5h6ZNvodxHO5WpExz8xCF6ubx4nlxnt_dXN5Pz28xxItusAkkd5FI6l-eYKYkFZhZbwJZK6yzlmBohKyOFsISUSopSFk5hhV1lgLIhullzy2De9Dz6mYkrHYzX34kQp9rEbsUaNDhVSmdtYTDjtmMpScFVpRU9zoiOdbZmzRd2BqWDpo2m_gf9X2n8q56GpS5EzjHvAcc_gBg-FpBaPfPJQV2bBsIiaSqUIJhRknetZN3qYkgpQvX3DcG6l6a_peleml5L62aONvf7m_hVxL4AoWmTsA</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2595103216</pqid></control><display><type>article</type><title>Parenclitic and Synolytic Networks Revisited</title><source>PubMed Central</source><creator>Nazarenko, Tatiana ; Whitwell, Harry J ; Blyuss, Oleg ; Zaikin, Alexey</creator><creatorcontrib>Nazarenko, Tatiana ; Whitwell, Harry J ; Blyuss, Oleg ; Zaikin, Alexey</creatorcontrib><description>Parenclitic networks provide a powerful and relatively new way to coerce multidimensional data into a graph form, enabling the application of graph theory to evaluate features. Different algorithms have been published for constructing parenclitic networks, leading to the question-which algorithm should be chosen? Initially, it was suggested to calculate the weight of an edge between two nodes of the network as a deviation from a linear regression, calculated for a dependence of one of these features on the other. This method works well, but not when features do not have a linear relationship. To overcome this, it was suggested to calculate edge weights as the distance from the area of most probable values by using a kernel density estimation. In these two approaches only one class (typically controls or healthy population) is used to construct a model. To take account of a second class, we have introduced synolytic networks, using a boundary between two classes on the feature-feature plane to estimate the weight of the edge between these features. Common to all these approaches is that topological indices can be used to evaluate the structure represented by the graphs. To compare these network approaches alongside more traditional machine-learning algorithms, we performed a substantial analysis using both synthetic data with known structure and publicly available datasets used for the benchmarking of ML-algorithms. Such a comparison has shown that the main advantage of parenclitic and synolytic networks is their resistance to over-fitting (occurring when the number of features is greater than the number of subjects) compared to other ML approaches. Secondly, the capability to visualise data in a structured form, even when this structure is not available allows for visual inspection and the application of well-established graph theory to their interpretation/application, eliminating the "black-box" nature of other ML approaches.</description><identifier>ISSN: 1664-8021</identifier><identifier>EISSN: 1664-8021</identifier><identifier>DOI: 10.3389/fgene.2021.733783</identifier><identifier>PMID: 34745212</identifier><language>eng</language><publisher>Switzerland: Frontiers Media S.A</publisher><subject>complexity ; Genetics ; graphs ; networks ; parenclitic ; synolytic</subject><ispartof>Frontiers in genetics, 2021-10, Vol.12, p.733783-733783</ispartof><rights>Copyright © 2021 Nazarenko, Whitwell, Blyuss and Zaikin.</rights><rights>Copyright © 2021 Nazarenko, Whitwell, Blyuss and Zaikin. 2021 Nazarenko, Whitwell, Blyuss and Zaikin</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c417t-fe72ce677cc6603970503b0be0b27bcb2402a57fa755b11d975d78c9090cfae23</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8564045/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8564045/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/34745212$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Nazarenko, Tatiana</creatorcontrib><creatorcontrib>Whitwell, Harry J</creatorcontrib><creatorcontrib>Blyuss, Oleg</creatorcontrib><creatorcontrib>Zaikin, Alexey</creatorcontrib><title>Parenclitic and Synolytic Networks Revisited</title><title>Frontiers in genetics</title><addtitle>Front Genet</addtitle><description>Parenclitic networks provide a powerful and relatively new way to coerce multidimensional data into a graph form, enabling the application of graph theory to evaluate features. Different algorithms have been published for constructing parenclitic networks, leading to the question-which algorithm should be chosen? Initially, it was suggested to calculate the weight of an edge between two nodes of the network as a deviation from a linear regression, calculated for a dependence of one of these features on the other. This method works well, but not when features do not have a linear relationship. To overcome this, it was suggested to calculate edge weights as the distance from the area of most probable values by using a kernel density estimation. In these two approaches only one class (typically controls or healthy population) is used to construct a model. To take account of a second class, we have introduced synolytic networks, using a boundary between two classes on the feature-feature plane to estimate the weight of the edge between these features. Common to all these approaches is that topological indices can be used to evaluate the structure represented by the graphs. To compare these network approaches alongside more traditional machine-learning algorithms, we performed a substantial analysis using both synthetic data with known structure and publicly available datasets used for the benchmarking of ML-algorithms. Such a comparison has shown that the main advantage of parenclitic and synolytic networks is their resistance to over-fitting (occurring when the number of features is greater than the number of subjects) compared to other ML approaches. Secondly, the capability to visualise data in a structured form, even when this structure is not available allows for visual inspection and the application of well-established graph theory to their interpretation/application, eliminating the "black-box" nature of other ML approaches.</description><subject>complexity</subject><subject>Genetics</subject><subject>graphs</subject><subject>networks</subject><subject>parenclitic</subject><subject>synolytic</subject><issn>1664-8021</issn><issn>1664-8021</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>DOA</sourceid><recordid>eNpVkclOwzAQQC0EoqjwAVxQjxxI8RrHFyRUsUkIEMvZsp1JMaRxsdOi_j0JBVR8GXuW55EeQocEjxkr1Gk1hQbGFFMylozJgm2hPZLnPCu61PbGfYAOUnrD3eGKMcZ30YBxyQUldA-dPJgIjat9693INOXoadWEetW_7qD9DPE9jR5h6ZNvodxHO5WpExz8xCF6ubx4nlxnt_dXN5Pz28xxItusAkkd5FI6l-eYKYkFZhZbwJZK6yzlmBohKyOFsISUSopSFk5hhV1lgLIhullzy2De9Dz6mYkrHYzX34kQp9rEbsUaNDhVSmdtYTDjtmMpScFVpRU9zoiOdbZmzRd2BqWDpo2m_gf9X2n8q56GpS5EzjHvAcc_gBg-FpBaPfPJQV2bBsIiaSqUIJhRknetZN3qYkgpQvX3DcG6l6a_peleml5L62aONvf7m_hVxL4AoWmTsA</recordid><startdate>20211020</startdate><enddate>20211020</enddate><creator>Nazarenko, Tatiana</creator><creator>Whitwell, Harry J</creator><creator>Blyuss, Oleg</creator><creator>Zaikin, Alexey</creator><general>Frontiers Media S.A</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20211020</creationdate><title>Parenclitic and Synolytic Networks Revisited</title><author>Nazarenko, Tatiana ; Whitwell, Harry J ; Blyuss, Oleg ; Zaikin, Alexey</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c417t-fe72ce677cc6603970503b0be0b27bcb2402a57fa755b11d975d78c9090cfae23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>complexity</topic><topic>Genetics</topic><topic>graphs</topic><topic>networks</topic><topic>parenclitic</topic><topic>synolytic</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Nazarenko, Tatiana</creatorcontrib><creatorcontrib>Whitwell, Harry J</creatorcontrib><creatorcontrib>Blyuss, Oleg</creatorcontrib><creatorcontrib>Zaikin, Alexey</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>Frontiers in genetics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Nazarenko, Tatiana</au><au>Whitwell, Harry J</au><au>Blyuss, Oleg</au><au>Zaikin, Alexey</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Parenclitic and Synolytic Networks Revisited</atitle><jtitle>Frontiers in genetics</jtitle><addtitle>Front Genet</addtitle><date>2021-10-20</date><risdate>2021</risdate><volume>12</volume><spage>733783</spage><epage>733783</epage><pages>733783-733783</pages><issn>1664-8021</issn><eissn>1664-8021</eissn><abstract>Parenclitic networks provide a powerful and relatively new way to coerce multidimensional data into a graph form, enabling the application of graph theory to evaluate features. Different algorithms have been published for constructing parenclitic networks, leading to the question-which algorithm should be chosen? Initially, it was suggested to calculate the weight of an edge between two nodes of the network as a deviation from a linear regression, calculated for a dependence of one of these features on the other. This method works well, but not when features do not have a linear relationship. To overcome this, it was suggested to calculate edge weights as the distance from the area of most probable values by using a kernel density estimation. In these two approaches only one class (typically controls or healthy population) is used to construct a model. To take account of a second class, we have introduced synolytic networks, using a boundary between two classes on the feature-feature plane to estimate the weight of the edge between these features. Common to all these approaches is that topological indices can be used to evaluate the structure represented by the graphs. To compare these network approaches alongside more traditional machine-learning algorithms, we performed a substantial analysis using both synthetic data with known structure and publicly available datasets used for the benchmarking of ML-algorithms. Such a comparison has shown that the main advantage of parenclitic and synolytic networks is their resistance to over-fitting (occurring when the number of features is greater than the number of subjects) compared to other ML approaches. Secondly, the capability to visualise data in a structured form, even when this structure is not available allows for visual inspection and the application of well-established graph theory to their interpretation/application, eliminating the "black-box" nature of other ML approaches.</abstract><cop>Switzerland</cop><pub>Frontiers Media S.A</pub><pmid>34745212</pmid><doi>10.3389/fgene.2021.733783</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1664-8021
ispartof Frontiers in genetics, 2021-10, Vol.12, p.733783-733783
issn 1664-8021
1664-8021
language eng
recordid cdi_doaj_primary_oai_doaj_org_article_ec9d7cbb8a034b1d9972ecfdb59090a5
source PubMed Central
subjects complexity
Genetics
graphs
networks
parenclitic
synolytic
title Parenclitic and Synolytic Networks Revisited
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T13%3A21%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Parenclitic%20and%20Synolytic%20Networks%20Revisited&rft.jtitle=Frontiers%20in%20genetics&rft.au=Nazarenko,%20Tatiana&rft.date=2021-10-20&rft.volume=12&rft.spage=733783&rft.epage=733783&rft.pages=733783-733783&rft.issn=1664-8021&rft.eissn=1664-8021&rft_id=info:doi/10.3389/fgene.2021.733783&rft_dat=%3Cproquest_doaj_%3E2595103216%3C/proquest_doaj_%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c417t-fe72ce677cc6603970503b0be0b27bcb2402a57fa755b11d975d78c9090cfae23%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2595103216&rft_id=info:pmid/34745212&rfr_iscdi=true