Loading…
GraphDCA -- a Framework for Node Distribution Comparison in Real and Synthetic Graphs
We argue that when comparing two graphs, the distribution of node structural features is more informative than global graph statistics which are often used in practice, especially to evaluate graph generative models. Thus, we present GraphDCA - a framework for evaluating similarity between graphs ba...
Saved in:
Published in: | arXiv.org 2022-02 |
---|---|
Main Authors: | , , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | |
---|---|
cites | |
container_end_page | |
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Ceylan, Ciwan Poklukar, Petra Hultin, Hanna Kravchenko, Alexander Varava, Anastasia Kragic, Danica |
description | We argue that when comparing two graphs, the distribution of node structural features is more informative than global graph statistics which are often used in practice, especially to evaluate graph generative models. Thus, we present GraphDCA - a framework for evaluating similarity between graphs based on the alignment of their respective node representation sets. The sets are compared using a recently proposed method for comparing representation spaces, called Delaunay Component Analysis (DCA), which we extend to graph data. To evaluate our framework, we generate a benchmark dataset of graphs exhibiting different structural patterns and show, using three node structure feature extractors, that GraphDCA recognizes graphs with both similar and dissimilar local structure. We then apply our framework to evaluate three publicly available real-world graph datasets and demonstrate, using gradual edge perturbations, that GraphDCA satisfyingly captures gradually decreasing similarity, unlike global statistics. Finally, we use GraphDCA to evaluate two state-of-the-art graph generative models, NetGAN and CELL, and conclude that further improvements are needed for these models to adequately reproduce local structural features. |
format | article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2627264822</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2627264822</sourcerecordid><originalsourceid>FETCH-proquest_journals_26272648223</originalsourceid><addsrcrecordid>eNqNjMsKwjAUBYMgWLT_cMF1oN70tZXW6sqFj3WJNqWpbVKTFPHvLeIHuDoDM5wZ8ZCxDU1DxAXxrW2DIMA4wShiHrnuDR-aPNsCpcChMLwXL20eUGsDR10JyKV1Rt5GJ7WCTPcDN9JOKBWcBO-AqwrOb-Ua4eQdvnd2ReY176zwf7sk62J3yQ50MPo5CuvKVo9GTarEGBOMwxSR_Vd9ANwDP2U</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2627264822</pqid></control><display><type>article</type><title>GraphDCA -- a Framework for Node Distribution Comparison in Real and Synthetic Graphs</title><source>Publicly Available Content Database</source><creator>Ceylan, Ciwan ; Poklukar, Petra ; Hultin, Hanna ; Kravchenko, Alexander ; Varava, Anastasia ; Kragic, Danica</creator><creatorcontrib>Ceylan, Ciwan ; Poklukar, Petra ; Hultin, Hanna ; Kravchenko, Alexander ; Varava, Anastasia ; Kragic, Danica</creatorcontrib><description>We argue that when comparing two graphs, the distribution of node structural features is more informative than global graph statistics which are often used in practice, especially to evaluate graph generative models. Thus, we present GraphDCA - a framework for evaluating similarity between graphs based on the alignment of their respective node representation sets. The sets are compared using a recently proposed method for comparing representation spaces, called Delaunay Component Analysis (DCA), which we extend to graph data. To evaluate our framework, we generate a benchmark dataset of graphs exhibiting different structural patterns and show, using three node structure feature extractors, that GraphDCA recognizes graphs with both similar and dissimilar local structure. We then apply our framework to evaluate three publicly available real-world graph datasets and demonstrate, using gradual edge perturbations, that GraphDCA satisfyingly captures gradually decreasing similarity, unlike global statistics. Finally, we use GraphDCA to evaluate two state-of-the-art graph generative models, NetGAN and CELL, and conclude that further improvements are needed for these models to adequately reproduce local structural features.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Datasets ; Feature extraction ; Feature recognition ; Graphs ; Nodes ; Perturbation ; Representations ; Similarity ; State-of-the-art reviews</subject><ispartof>arXiv.org, 2022-02</ispartof><rights>2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2627264822?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>776,780,25731,36989,44566</link.rule.ids></links><search><creatorcontrib>Ceylan, Ciwan</creatorcontrib><creatorcontrib>Poklukar, Petra</creatorcontrib><creatorcontrib>Hultin, Hanna</creatorcontrib><creatorcontrib>Kravchenko, Alexander</creatorcontrib><creatorcontrib>Varava, Anastasia</creatorcontrib><creatorcontrib>Kragic, Danica</creatorcontrib><title>GraphDCA -- a Framework for Node Distribution Comparison in Real and Synthetic Graphs</title><title>arXiv.org</title><description>We argue that when comparing two graphs, the distribution of node structural features is more informative than global graph statistics which are often used in practice, especially to evaluate graph generative models. Thus, we present GraphDCA - a framework for evaluating similarity between graphs based on the alignment of their respective node representation sets. The sets are compared using a recently proposed method for comparing representation spaces, called Delaunay Component Analysis (DCA), which we extend to graph data. To evaluate our framework, we generate a benchmark dataset of graphs exhibiting different structural patterns and show, using three node structure feature extractors, that GraphDCA recognizes graphs with both similar and dissimilar local structure. We then apply our framework to evaluate three publicly available real-world graph datasets and demonstrate, using gradual edge perturbations, that GraphDCA satisfyingly captures gradually decreasing similarity, unlike global statistics. Finally, we use GraphDCA to evaluate two state-of-the-art graph generative models, NetGAN and CELL, and conclude that further improvements are needed for these models to adequately reproduce local structural features.</description><subject>Datasets</subject><subject>Feature extraction</subject><subject>Feature recognition</subject><subject>Graphs</subject><subject>Nodes</subject><subject>Perturbation</subject><subject>Representations</subject><subject>Similarity</subject><subject>State-of-the-art reviews</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNjMsKwjAUBYMgWLT_cMF1oN70tZXW6sqFj3WJNqWpbVKTFPHvLeIHuDoDM5wZ8ZCxDU1DxAXxrW2DIMA4wShiHrnuDR-aPNsCpcChMLwXL20eUGsDR10JyKV1Rt5GJ7WCTPcDN9JOKBWcBO-AqwrOb-Ua4eQdvnd2ReY176zwf7sk62J3yQ50MPo5CuvKVo9GTarEGBOMwxSR_Vd9ANwDP2U</recordid><startdate>20220209</startdate><enddate>20220209</enddate><creator>Ceylan, Ciwan</creator><creator>Poklukar, Petra</creator><creator>Hultin, Hanna</creator><creator>Kravchenko, Alexander</creator><creator>Varava, Anastasia</creator><creator>Kragic, Danica</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20220209</creationdate><title>GraphDCA -- a Framework for Node Distribution Comparison in Real and Synthetic Graphs</title><author>Ceylan, Ciwan ; Poklukar, Petra ; Hultin, Hanna ; Kravchenko, Alexander ; Varava, Anastasia ; Kragic, Danica</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_26272648223</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Datasets</topic><topic>Feature extraction</topic><topic>Feature recognition</topic><topic>Graphs</topic><topic>Nodes</topic><topic>Perturbation</topic><topic>Representations</topic><topic>Similarity</topic><topic>State-of-the-art reviews</topic><toplevel>online_resources</toplevel><creatorcontrib>Ceylan, Ciwan</creatorcontrib><creatorcontrib>Poklukar, Petra</creatorcontrib><creatorcontrib>Hultin, Hanna</creatorcontrib><creatorcontrib>Kravchenko, Alexander</creatorcontrib><creatorcontrib>Varava, Anastasia</creatorcontrib><creatorcontrib>Kragic, Danica</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ceylan, Ciwan</au><au>Poklukar, Petra</au><au>Hultin, Hanna</au><au>Kravchenko, Alexander</au><au>Varava, Anastasia</au><au>Kragic, Danica</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>GraphDCA -- a Framework for Node Distribution Comparison in Real and Synthetic Graphs</atitle><jtitle>arXiv.org</jtitle><date>2022-02-09</date><risdate>2022</risdate><eissn>2331-8422</eissn><abstract>We argue that when comparing two graphs, the distribution of node structural features is more informative than global graph statistics which are often used in practice, especially to evaluate graph generative models. Thus, we present GraphDCA - a framework for evaluating similarity between graphs based on the alignment of their respective node representation sets. The sets are compared using a recently proposed method for comparing representation spaces, called Delaunay Component Analysis (DCA), which we extend to graph data. To evaluate our framework, we generate a benchmark dataset of graphs exhibiting different structural patterns and show, using three node structure feature extractors, that GraphDCA recognizes graphs with both similar and dissimilar local structure. We then apply our framework to evaluate three publicly available real-world graph datasets and demonstrate, using gradual edge perturbations, that GraphDCA satisfyingly captures gradually decreasing similarity, unlike global statistics. Finally, we use GraphDCA to evaluate two state-of-the-art graph generative models, NetGAN and CELL, and conclude that further improvements are needed for these models to adequately reproduce local structural features.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2022-02 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2627264822 |
source | Publicly Available Content Database |
subjects | Datasets Feature extraction Feature recognition Graphs Nodes Perturbation Representations Similarity State-of-the-art reviews |
title | GraphDCA -- a Framework for Node Distribution Comparison in Real and Synthetic Graphs |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T22%3A18%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=GraphDCA%20--%20a%20Framework%20for%20Node%20Distribution%20Comparison%20in%20Real%20and%20Synthetic%20Graphs&rft.jtitle=arXiv.org&rft.au=Ceylan,%20Ciwan&rft.date=2022-02-09&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2627264822%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_26272648223%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2627264822&rft_id=info:pmid/&rfr_iscdi=true |