Loading…

Verification-Free Approaches to Efficient Locally Densest Subgraph Discovery

Finding dense subgraphs from a large graph is a fundamental graph mining task with many applications. The notion is recently formulated of locally densest subgraph (LDS) is recently formulated to identify multiple dense subgraphs that cover different regions of a large graph. Informally, an LDS is a...

Full description

Saved in:
Bibliographic Details
Main Authors: Trung, Tran Ba, Chang, Lijun, Long, Nguyen Tien, Yao, Kai, Thanh Binh, Huynh Thi
Format: Conference Proceeding
Language:English
Subjects:
Online Access:Request full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page 13
container_issue
container_start_page 1
container_title
container_volume
creator Trung, Tran Ba
Chang, Lijun
Long, Nguyen Tien
Yao, Kai
Thanh Binh, Huynh Thi
description Finding dense subgraphs from a large graph is a fundamental graph mining task with many applications. The notion is recently formulated of locally densest subgraph (LDS) is recently formulated to identify multiple dense subgraphs that cover different regions of a large graph. Informally, an LDS is a subgraph with the highest density in its local region. The state-of-the-art algorithm for computing top-k LDSes with the highest densities is LDS. It iteratively computes the densest subgraph and removes it from the graph, where all the computed densest subgraphs form the candidates of LDSes. Then, each candidate is verified through a costly maximum flow computation. Although advanced pruning techniques are proposed in LDS, the verification step is still time consuming especially for not-so-small k values. In this paper, we aim to improve the efficiency of finding top-k LDSes by designing verification-free approaches. Our algorithms are based on our observation that the set of maximal λ-compact subgraphs for all possible λ values form a hierarchical structure, and LDSes are simply leaves in the hierarchical structure. Thus, we propose a divide-and-conquer algorithm LDS-DC as well as an optimized algorithm LDS-Opt to efficiently identify top-k LDSes without constructing the entire hierarchical structure. Both of our algorithms have lower time complexities than LDS. Extensive empirical studies on real graphs show that our optimized algorithm LDS-Opt outperforms LDS for all k values, and the improvement is up-to several orders of magnitude.
doi_str_mv 10.1109/ICDE55515.2023.00008
format conference_proceeding
fullrecord <record><control><sourceid>ieee_CHZPO</sourceid><recordid>TN_cdi_ieee_primary_10184510</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10184510</ieee_id><sourcerecordid>10184510</sourcerecordid><originalsourceid>FETCH-LOGICAL-i91t-ab6c9a08b9f5354fdb988e5030593ef012e83c169286a69a3986761b790f302f3</originalsourceid><addsrcrecordid>eNotjs1KAzEUhaMgWGrfoIu8wNR7kyaTuyz90cKAC4u4K5nxxkbqzJCMwry9FT2bs_gOH0eIOcICEeh-v95sjTFoFgqUXsAl7krMqCSnDWilVEnXYqJ0aQpQ9vVWzHL--J3REtHARFQvnGKIjR9i1xa7xCxXfZ8635w4y6GT23ChkdtBVl3jz-dRbrjNnAf5_FW_J9-f5CbmpvvmNN6Jm-DPmWf_PRWH3fawfiyqp4f9elUVkXAofG0b8uBqCkabZXiryTm-3AVDmgOgYqcbtKSc9Za8JmdLi3VJEDSooKdi_qeNzHzsU_z0aTwioFsaBP0DSJZOUQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Verification-Free Approaches to Efficient Locally Densest Subgraph Discovery</title><source>IEEE Xplore All Conference Series</source><creator>Trung, Tran Ba ; Chang, Lijun ; Long, Nguyen Tien ; Yao, Kai ; Thanh Binh, Huynh Thi</creator><creatorcontrib>Trung, Tran Ba ; Chang, Lijun ; Long, Nguyen Tien ; Yao, Kai ; Thanh Binh, Huynh Thi</creatorcontrib><description>Finding dense subgraphs from a large graph is a fundamental graph mining task with many applications. The notion is recently formulated of locally densest subgraph (LDS) is recently formulated to identify multiple dense subgraphs that cover different regions of a large graph. Informally, an LDS is a subgraph with the highest density in its local region. The state-of-the-art algorithm for computing top-k LDSes with the highest densities is LDS. It iteratively computes the densest subgraph and removes it from the graph, where all the computed densest subgraphs form the candidates of LDSes. Then, each candidate is verified through a costly maximum flow computation. Although advanced pruning techniques are proposed in LDS, the verification step is still time consuming especially for not-so-small k values. In this paper, we aim to improve the efficiency of finding top-k LDSes by designing verification-free approaches. Our algorithms are based on our observation that the set of maximal λ-compact subgraphs for all possible λ values form a hierarchical structure, and LDSes are simply leaves in the hierarchical structure. Thus, we propose a divide-and-conquer algorithm LDS-DC as well as an optimized algorithm LDS-Opt to efficiently identify top-k LDSes without constructing the entire hierarchical structure. Both of our algorithms have lower time complexities than LDS. Extensive empirical studies on real graphs show that our optimized algorithm LDS-Opt outperforms LDS for all k values, and the improvement is up-to several orders of magnitude.</description><identifier>EISSN: 2375-026X</identifier><identifier>EISBN: 9798350322279</identifier><identifier>DOI: 10.1109/ICDE55515.2023.00008</identifier><identifier>CODEN: IEEPAD</identifier><language>eng</language><publisher>IEEE</publisher><subject>Data engineering ; Locally Dense Subgraphs ; Locally Densest Subgraphs ; Maximal λ-compact Subgraphs ; Programming ; Task analysis ; Time complexity</subject><ispartof>2023 IEEE 39th International Conference on Data Engineering (ICDE), 2023, p.1-13</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10184510$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,27925,54555,54932</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10184510$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Trung, Tran Ba</creatorcontrib><creatorcontrib>Chang, Lijun</creatorcontrib><creatorcontrib>Long, Nguyen Tien</creatorcontrib><creatorcontrib>Yao, Kai</creatorcontrib><creatorcontrib>Thanh Binh, Huynh Thi</creatorcontrib><title>Verification-Free Approaches to Efficient Locally Densest Subgraph Discovery</title><title>2023 IEEE 39th International Conference on Data Engineering (ICDE)</title><addtitle>ICDE</addtitle><description>Finding dense subgraphs from a large graph is a fundamental graph mining task with many applications. The notion is recently formulated of locally densest subgraph (LDS) is recently formulated to identify multiple dense subgraphs that cover different regions of a large graph. Informally, an LDS is a subgraph with the highest density in its local region. The state-of-the-art algorithm for computing top-k LDSes with the highest densities is LDS. It iteratively computes the densest subgraph and removes it from the graph, where all the computed densest subgraphs form the candidates of LDSes. Then, each candidate is verified through a costly maximum flow computation. Although advanced pruning techniques are proposed in LDS, the verification step is still time consuming especially for not-so-small k values. In this paper, we aim to improve the efficiency of finding top-k LDSes by designing verification-free approaches. Our algorithms are based on our observation that the set of maximal λ-compact subgraphs for all possible λ values form a hierarchical structure, and LDSes are simply leaves in the hierarchical structure. Thus, we propose a divide-and-conquer algorithm LDS-DC as well as an optimized algorithm LDS-Opt to efficiently identify top-k LDSes without constructing the entire hierarchical structure. Both of our algorithms have lower time complexities than LDS. Extensive empirical studies on real graphs show that our optimized algorithm LDS-Opt outperforms LDS for all k values, and the improvement is up-to several orders of magnitude.</description><subject>Data engineering</subject><subject>Locally Dense Subgraphs</subject><subject>Locally Densest Subgraphs</subject><subject>Maximal λ-compact Subgraphs</subject><subject>Programming</subject><subject>Task analysis</subject><subject>Time complexity</subject><issn>2375-026X</issn><isbn>9798350322279</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2023</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><recordid>eNotjs1KAzEUhaMgWGrfoIu8wNR7kyaTuyz90cKAC4u4K5nxxkbqzJCMwry9FT2bs_gOH0eIOcICEeh-v95sjTFoFgqUXsAl7krMqCSnDWilVEnXYqJ0aQpQ9vVWzHL--J3REtHARFQvnGKIjR9i1xa7xCxXfZ8635w4y6GT23ChkdtBVl3jz-dRbrjNnAf5_FW_J9-f5CbmpvvmNN6Jm-DPmWf_PRWH3fawfiyqp4f9elUVkXAofG0b8uBqCkabZXiryTm-3AVDmgOgYqcbtKSc9Za8JmdLi3VJEDSooKdi_qeNzHzsU_z0aTwioFsaBP0DSJZOUQ</recordid><startdate>202304</startdate><enddate>202304</enddate><creator>Trung, Tran Ba</creator><creator>Chang, Lijun</creator><creator>Long, Nguyen Tien</creator><creator>Yao, Kai</creator><creator>Thanh Binh, Huynh Thi</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>202304</creationdate><title>Verification-Free Approaches to Efficient Locally Densest Subgraph Discovery</title><author>Trung, Tran Ba ; Chang, Lijun ; Long, Nguyen Tien ; Yao, Kai ; Thanh Binh, Huynh Thi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i91t-ab6c9a08b9f5354fdb988e5030593ef012e83c169286a69a3986761b790f302f3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Data engineering</topic><topic>Locally Dense Subgraphs</topic><topic>Locally Densest Subgraphs</topic><topic>Maximal λ-compact Subgraphs</topic><topic>Programming</topic><topic>Task analysis</topic><topic>Time complexity</topic><toplevel>online_resources</toplevel><creatorcontrib>Trung, Tran Ba</creatorcontrib><creatorcontrib>Chang, Lijun</creatorcontrib><creatorcontrib>Long, Nguyen Tien</creatorcontrib><creatorcontrib>Yao, Kai</creatorcontrib><creatorcontrib>Thanh Binh, Huynh Thi</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE/IET Electronic Library</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Trung, Tran Ba</au><au>Chang, Lijun</au><au>Long, Nguyen Tien</au><au>Yao, Kai</au><au>Thanh Binh, Huynh Thi</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Verification-Free Approaches to Efficient Locally Densest Subgraph Discovery</atitle><btitle>2023 IEEE 39th International Conference on Data Engineering (ICDE)</btitle><stitle>ICDE</stitle><date>2023-04</date><risdate>2023</risdate><spage>1</spage><epage>13</epage><pages>1-13</pages><eissn>2375-026X</eissn><eisbn>9798350322279</eisbn><coden>IEEPAD</coden><abstract>Finding dense subgraphs from a large graph is a fundamental graph mining task with many applications. The notion is recently formulated of locally densest subgraph (LDS) is recently formulated to identify multiple dense subgraphs that cover different regions of a large graph. Informally, an LDS is a subgraph with the highest density in its local region. The state-of-the-art algorithm for computing top-k LDSes with the highest densities is LDS. It iteratively computes the densest subgraph and removes it from the graph, where all the computed densest subgraphs form the candidates of LDSes. Then, each candidate is verified through a costly maximum flow computation. Although advanced pruning techniques are proposed in LDS, the verification step is still time consuming especially for not-so-small k values. In this paper, we aim to improve the efficiency of finding top-k LDSes by designing verification-free approaches. Our algorithms are based on our observation that the set of maximal λ-compact subgraphs for all possible λ values form a hierarchical structure, and LDSes are simply leaves in the hierarchical structure. Thus, we propose a divide-and-conquer algorithm LDS-DC as well as an optimized algorithm LDS-Opt to efficiently identify top-k LDSes without constructing the entire hierarchical structure. Both of our algorithms have lower time complexities than LDS. Extensive empirical studies on real graphs show that our optimized algorithm LDS-Opt outperforms LDS for all k values, and the improvement is up-to several orders of magnitude.</abstract><pub>IEEE</pub><doi>10.1109/ICDE55515.2023.00008</doi><tpages>13</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier EISSN: 2375-026X
ispartof 2023 IEEE 39th International Conference on Data Engineering (ICDE), 2023, p.1-13
issn 2375-026X
language eng
recordid cdi_ieee_primary_10184510
source IEEE Xplore All Conference Series
subjects Data engineering
Locally Dense Subgraphs
Locally Densest Subgraphs
Maximal λ-compact Subgraphs
Programming
Task analysis
Time complexity
title Verification-Free Approaches to Efficient Locally Densest Subgraph Discovery
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T18%3A33%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_CHZPO&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Verification-Free%20Approaches%20to%20Efficient%20Locally%20Densest%20Subgraph%20Discovery&rft.btitle=2023%20IEEE%2039th%20International%20Conference%20on%20Data%20Engineering%20(ICDE)&rft.au=Trung,%20Tran%20Ba&rft.date=2023-04&rft.spage=1&rft.epage=13&rft.pages=1-13&rft.eissn=2375-026X&rft.coden=IEEPAD&rft_id=info:doi/10.1109/ICDE55515.2023.00008&rft.eisbn=9798350322279&rft_dat=%3Cieee_CHZPO%3E10184510%3C/ieee_CHZPO%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-i91t-ab6c9a08b9f5354fdb988e5030593ef012e83c169286a69a3986761b790f302f3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10184510&rfr_iscdi=true