Loading…

GraphDCA -- a Framework for Node Distribution Comparison in Real and Synthetic Graphs

We argue that when comparing two graphs, the distribution of node structural features is more informative than global graph statistics which are often used in practice, especially to evaluate graph generative models. Thus, we present GraphDCA - a framework for evaluating similarity between graphs ba...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2022-02
Main Authors: Ceylan, Ciwan, Poklukar, Petra, Hultin, Hanna, Kravchenko, Alexander, Varava, Anastasia, Kragic, Danica
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Ceylan, Ciwan
Poklukar, Petra
Hultin, Hanna
Kravchenko, Alexander
Varava, Anastasia
Kragic, Danica
description We argue that when comparing two graphs, the distribution of node structural features is more informative than global graph statistics which are often used in practice, especially to evaluate graph generative models. Thus, we present GraphDCA - a framework for evaluating similarity between graphs based on the alignment of their respective node representation sets. The sets are compared using a recently proposed method for comparing representation spaces, called Delaunay Component Analysis (DCA), which we extend to graph data. To evaluate our framework, we generate a benchmark dataset of graphs exhibiting different structural patterns and show, using three node structure feature extractors, that GraphDCA recognizes graphs with both similar and dissimilar local structure. We then apply our framework to evaluate three publicly available real-world graph datasets and demonstrate, using gradual edge perturbations, that GraphDCA satisfyingly captures gradually decreasing similarity, unlike global statistics. Finally, we use GraphDCA to evaluate two state-of-the-art graph generative models, NetGAN and CELL, and conclude that further improvements are needed for these models to adequately reproduce local structural features.
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2627264822</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2627264822</sourcerecordid><originalsourceid>FETCH-proquest_journals_26272648223</originalsourceid><addsrcrecordid>eNqNjMsKwjAUBYMgWLT_cMF1oN70tZXW6sqFj3WJNqWpbVKTFPHvLeIHuDoDM5wZ8ZCxDU1DxAXxrW2DIMA4wShiHrnuDR-aPNsCpcChMLwXL20eUGsDR10JyKV1Rt5GJ7WCTPcDN9JOKBWcBO-AqwrOb-Ua4eQdvnd2ReY176zwf7sk62J3yQ50MPo5CuvKVo9GTarEGBOMwxSR_Vd9ANwDP2U</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2627264822</pqid></control><display><type>article</type><title>GraphDCA -- a Framework for Node Distribution Comparison in Real and Synthetic Graphs</title><source>Publicly Available Content Database</source><creator>Ceylan, Ciwan ; Poklukar, Petra ; Hultin, Hanna ; Kravchenko, Alexander ; Varava, Anastasia ; Kragic, Danica</creator><creatorcontrib>Ceylan, Ciwan ; Poklukar, Petra ; Hultin, Hanna ; Kravchenko, Alexander ; Varava, Anastasia ; Kragic, Danica</creatorcontrib><description>We argue that when comparing two graphs, the distribution of node structural features is more informative than global graph statistics which are often used in practice, especially to evaluate graph generative models. Thus, we present GraphDCA - a framework for evaluating similarity between graphs based on the alignment of their respective node representation sets. The sets are compared using a recently proposed method for comparing representation spaces, called Delaunay Component Analysis (DCA), which we extend to graph data. To evaluate our framework, we generate a benchmark dataset of graphs exhibiting different structural patterns and show, using three node structure feature extractors, that GraphDCA recognizes graphs with both similar and dissimilar local structure. We then apply our framework to evaluate three publicly available real-world graph datasets and demonstrate, using gradual edge perturbations, that GraphDCA satisfyingly captures gradually decreasing similarity, unlike global statistics. Finally, we use GraphDCA to evaluate two state-of-the-art graph generative models, NetGAN and CELL, and conclude that further improvements are needed for these models to adequately reproduce local structural features.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Datasets ; Feature extraction ; Feature recognition ; Graphs ; Nodes ; Perturbation ; Representations ; Similarity ; State-of-the-art reviews</subject><ispartof>arXiv.org, 2022-02</ispartof><rights>2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2627264822?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>776,780,25731,36989,44566</link.rule.ids></links><search><creatorcontrib>Ceylan, Ciwan</creatorcontrib><creatorcontrib>Poklukar, Petra</creatorcontrib><creatorcontrib>Hultin, Hanna</creatorcontrib><creatorcontrib>Kravchenko, Alexander</creatorcontrib><creatorcontrib>Varava, Anastasia</creatorcontrib><creatorcontrib>Kragic, Danica</creatorcontrib><title>GraphDCA -- a Framework for Node Distribution Comparison in Real and Synthetic Graphs</title><title>arXiv.org</title><description>We argue that when comparing two graphs, the distribution of node structural features is more informative than global graph statistics which are often used in practice, especially to evaluate graph generative models. Thus, we present GraphDCA - a framework for evaluating similarity between graphs based on the alignment of their respective node representation sets. The sets are compared using a recently proposed method for comparing representation spaces, called Delaunay Component Analysis (DCA), which we extend to graph data. To evaluate our framework, we generate a benchmark dataset of graphs exhibiting different structural patterns and show, using three node structure feature extractors, that GraphDCA recognizes graphs with both similar and dissimilar local structure. We then apply our framework to evaluate three publicly available real-world graph datasets and demonstrate, using gradual edge perturbations, that GraphDCA satisfyingly captures gradually decreasing similarity, unlike global statistics. Finally, we use GraphDCA to evaluate two state-of-the-art graph generative models, NetGAN and CELL, and conclude that further improvements are needed for these models to adequately reproduce local structural features.</description><subject>Datasets</subject><subject>Feature extraction</subject><subject>Feature recognition</subject><subject>Graphs</subject><subject>Nodes</subject><subject>Perturbation</subject><subject>Representations</subject><subject>Similarity</subject><subject>State-of-the-art reviews</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNqNjMsKwjAUBYMgWLT_cMF1oN70tZXW6sqFj3WJNqWpbVKTFPHvLeIHuDoDM5wZ8ZCxDU1DxAXxrW2DIMA4wShiHrnuDR-aPNsCpcChMLwXL20eUGsDR10JyKV1Rt5GJ7WCTPcDN9JOKBWcBO-AqwrOb-Ua4eQdvnd2ReY176zwf7sk62J3yQ50MPo5CuvKVo9GTarEGBOMwxSR_Vd9ANwDP2U</recordid><startdate>20220209</startdate><enddate>20220209</enddate><creator>Ceylan, Ciwan</creator><creator>Poklukar, Petra</creator><creator>Hultin, Hanna</creator><creator>Kravchenko, Alexander</creator><creator>Varava, Anastasia</creator><creator>Kragic, Danica</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20220209</creationdate><title>GraphDCA -- a Framework for Node Distribution Comparison in Real and Synthetic Graphs</title><author>Ceylan, Ciwan ; Poklukar, Petra ; Hultin, Hanna ; Kravchenko, Alexander ; Varava, Anastasia ; Kragic, Danica</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_26272648223</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Datasets</topic><topic>Feature extraction</topic><topic>Feature recognition</topic><topic>Graphs</topic><topic>Nodes</topic><topic>Perturbation</topic><topic>Representations</topic><topic>Similarity</topic><topic>State-of-the-art reviews</topic><toplevel>online_resources</toplevel><creatorcontrib>Ceylan, Ciwan</creatorcontrib><creatorcontrib>Poklukar, Petra</creatorcontrib><creatorcontrib>Hultin, Hanna</creatorcontrib><creatorcontrib>Kravchenko, Alexander</creatorcontrib><creatorcontrib>Varava, Anastasia</creatorcontrib><creatorcontrib>Kragic, Danica</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ceylan, Ciwan</au><au>Poklukar, Petra</au><au>Hultin, Hanna</au><au>Kravchenko, Alexander</au><au>Varava, Anastasia</au><au>Kragic, Danica</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>GraphDCA -- a Framework for Node Distribution Comparison in Real and Synthetic Graphs</atitle><jtitle>arXiv.org</jtitle><date>2022-02-09</date><risdate>2022</risdate><eissn>2331-8422</eissn><abstract>We argue that when comparing two graphs, the distribution of node structural features is more informative than global graph statistics which are often used in practice, especially to evaluate graph generative models. Thus, we present GraphDCA - a framework for evaluating similarity between graphs based on the alignment of their respective node representation sets. The sets are compared using a recently proposed method for comparing representation spaces, called Delaunay Component Analysis (DCA), which we extend to graph data. To evaluate our framework, we generate a benchmark dataset of graphs exhibiting different structural patterns and show, using three node structure feature extractors, that GraphDCA recognizes graphs with both similar and dissimilar local structure. We then apply our framework to evaluate three publicly available real-world graph datasets and demonstrate, using gradual edge perturbations, that GraphDCA satisfyingly captures gradually decreasing similarity, unlike global statistics. Finally, we use GraphDCA to evaluate two state-of-the-art graph generative models, NetGAN and CELL, and conclude that further improvements are needed for these models to adequately reproduce local structural features.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2022-02
issn 2331-8422
language eng
recordid cdi_proquest_journals_2627264822
source Publicly Available Content Database
subjects Datasets
Feature extraction
Feature recognition
Graphs
Nodes
Perturbation
Representations
Similarity
State-of-the-art reviews
title GraphDCA -- a Framework for Node Distribution Comparison in Real and Synthetic Graphs
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T22%3A18%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=GraphDCA%20--%20a%20Framework%20for%20Node%20Distribution%20Comparison%20in%20Real%20and%20Synthetic%20Graphs&rft.jtitle=arXiv.org&rft.au=Ceylan,%20Ciwan&rft.date=2022-02-09&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2627264822%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_26272648223%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2627264822&rft_id=info:pmid/&rfr_iscdi=true