Loading…

Detecting Overlapping Communities from Local Spectral Subspaces

Based on the definition of local spectral subspace, we propose a novel approach called LOSP for local overlapping community detection. Instead of using the invariant subspace spanned by the dominant eigenvectors of the entire network, we run the power method for a few steps to approximate the leadin...

Full description

Saved in:
Bibliographic Details
Published in:arXiv.org 2015-09
Main Authors: He, Kun, Sun, Yiwei, Bindel, David, Hopcroft, John E, Li, Yixuan
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator He, Kun
Sun, Yiwei
Bindel, David
Hopcroft, John E
Li, Yixuan
description Based on the definition of local spectral subspace, we propose a novel approach called LOSP for local overlapping community detection. Instead of using the invariant subspace spanned by the dominant eigenvectors of the entire network, we run the power method for a few steps to approximate the leading eigenvectors that depict the embedding of the local neighborhood structure around seeds of interest. We then seek a sparse approximate indicator vector in the local spectral subspace spanned by these vectors such that the seeds are in its support. We evaluate LOSP on five large real world networks across various domains with labeled ground-truth communities and compare the results with the state-of-the-art community detection approaches. LOSP identifies the members of a target community with high accuracy from very few seed members, and outperforms the local Heat Kernel or PageRank diffusions as well as the global baselines. Two candidate definitions of the local spectral subspace are analyzed, and different community scoring functions for determining the community boundary, including two new metrics, are thoroughly evaluated. The structural properties of different seed sets and the impact of the seed set size are discussed. We observe low degree seeds behave better, and LOSP is robust even when started from a single random seed. Using LOSP as a subroutine and starting from each ego connected component, we try the harder yet significant task of identifying all communities a single vertex is in. Experiments show that the proposed method achieves high F1 measures on the detected multiple local overlapping communities containing the seed vertex.
format article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2083137187</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2083137187</sourcerecordid><originalsourceid>FETCH-proquest_journals_20831371873</originalsourceid><addsrcrecordid>eNpjYuA0MjY21LUwMTLiYOAtLs4yMDAwMjM3MjU15mSwd0ktSU0uycxLV_AvSy3KSSwoALGd83NzS_MySzJTixXSivJzFXzykxNzFIILgGqLQIzSpOKCxOTUYh4G1rTEnOJUXijNzaDs5hri7KFbUJRfWJpaXBKflV9alAeUijcysDA2NDY3tDA3Jk4VACccOQU</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2083137187</pqid></control><display><type>article</type><title>Detecting Overlapping Communities from Local Spectral Subspaces</title><source>Publicly Available Content (ProQuest)</source><creator>He, Kun ; Sun, Yiwei ; Bindel, David ; Hopcroft, John E ; Li, Yixuan</creator><creatorcontrib>He, Kun ; Sun, Yiwei ; Bindel, David ; Hopcroft, John E ; Li, Yixuan</creatorcontrib><description>Based on the definition of local spectral subspace, we propose a novel approach called LOSP for local overlapping community detection. Instead of using the invariant subspace spanned by the dominant eigenvectors of the entire network, we run the power method for a few steps to approximate the leading eigenvectors that depict the embedding of the local neighborhood structure around seeds of interest. We then seek a sparse approximate indicator vector in the local spectral subspace spanned by these vectors such that the seeds are in its support. We evaluate LOSP on five large real world networks across various domains with labeled ground-truth communities and compare the results with the state-of-the-art community detection approaches. LOSP identifies the members of a target community with high accuracy from very few seed members, and outperforms the local Heat Kernel or PageRank diffusions as well as the global baselines. Two candidate definitions of the local spectral subspace are analyzed, and different community scoring functions for determining the community boundary, including two new metrics, are thoroughly evaluated. The structural properties of different seed sets and the impact of the seed set size are discussed. We observe low degree seeds behave better, and LOSP is robust even when started from a single random seed. Using LOSP as a subroutine and starting from each ego connected component, we try the harder yet significant task of identifying all communities a single vertex is in. Experiments show that the proposed method achieves high F1 measures on the detected multiple local overlapping communities containing the seed vertex.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Communities ; Domains ; Eigenvectors ; Search engines ; Seeds ; Spectra ; Subspaces</subject><ispartof>arXiv.org, 2015-09</ispartof><rights>2015. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2083137187?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>776,780,25731,36989,44566</link.rule.ids></links><search><creatorcontrib>He, Kun</creatorcontrib><creatorcontrib>Sun, Yiwei</creatorcontrib><creatorcontrib>Bindel, David</creatorcontrib><creatorcontrib>Hopcroft, John E</creatorcontrib><creatorcontrib>Li, Yixuan</creatorcontrib><title>Detecting Overlapping Communities from Local Spectral Subspaces</title><title>arXiv.org</title><description>Based on the definition of local spectral subspace, we propose a novel approach called LOSP for local overlapping community detection. Instead of using the invariant subspace spanned by the dominant eigenvectors of the entire network, we run the power method for a few steps to approximate the leading eigenvectors that depict the embedding of the local neighborhood structure around seeds of interest. We then seek a sparse approximate indicator vector in the local spectral subspace spanned by these vectors such that the seeds are in its support. We evaluate LOSP on five large real world networks across various domains with labeled ground-truth communities and compare the results with the state-of-the-art community detection approaches. LOSP identifies the members of a target community with high accuracy from very few seed members, and outperforms the local Heat Kernel or PageRank diffusions as well as the global baselines. Two candidate definitions of the local spectral subspace are analyzed, and different community scoring functions for determining the community boundary, including two new metrics, are thoroughly evaluated. The structural properties of different seed sets and the impact of the seed set size are discussed. We observe low degree seeds behave better, and LOSP is robust even when started from a single random seed. Using LOSP as a subroutine and starting from each ego connected component, we try the harder yet significant task of identifying all communities a single vertex is in. Experiments show that the proposed method achieves high F1 measures on the detected multiple local overlapping communities containing the seed vertex.</description><subject>Communities</subject><subject>Domains</subject><subject>Eigenvectors</subject><subject>Search engines</subject><subject>Seeds</subject><subject>Spectra</subject><subject>Subspaces</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>PIMPY</sourceid><recordid>eNpjYuA0MjY21LUwMTLiYOAtLs4yMDAwMjM3MjU15mSwd0ktSU0uycxLV_AvSy3KSSwoALGd83NzS_MySzJTixXSivJzFXzykxNzFIILgGqLQIzSpOKCxOTUYh4G1rTEnOJUXijNzaDs5hri7KFbUJRfWJpaXBKflV9alAeUijcysDA2NDY3tDA3Jk4VACccOQU</recordid><startdate>20150927</startdate><enddate>20150927</enddate><creator>He, Kun</creator><creator>Sun, Yiwei</creator><creator>Bindel, David</creator><creator>Hopcroft, John E</creator><creator>Li, Yixuan</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20150927</creationdate><title>Detecting Overlapping Communities from Local Spectral Subspaces</title><author>He, Kun ; Sun, Yiwei ; Bindel, David ; Hopcroft, John E ; Li, Yixuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_20831371873</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Communities</topic><topic>Domains</topic><topic>Eigenvectors</topic><topic>Search engines</topic><topic>Seeds</topic><topic>Spectra</topic><topic>Subspaces</topic><toplevel>online_resources</toplevel><creatorcontrib>He, Kun</creatorcontrib><creatorcontrib>Sun, Yiwei</creatorcontrib><creatorcontrib>Bindel, David</creatorcontrib><creatorcontrib>Hopcroft, John E</creatorcontrib><creatorcontrib>Li, Yixuan</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content (ProQuest)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>He, Kun</au><au>Sun, Yiwei</au><au>Bindel, David</au><au>Hopcroft, John E</au><au>Li, Yixuan</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Detecting Overlapping Communities from Local Spectral Subspaces</atitle><jtitle>arXiv.org</jtitle><date>2015-09-27</date><risdate>2015</risdate><eissn>2331-8422</eissn><abstract>Based on the definition of local spectral subspace, we propose a novel approach called LOSP for local overlapping community detection. Instead of using the invariant subspace spanned by the dominant eigenvectors of the entire network, we run the power method for a few steps to approximate the leading eigenvectors that depict the embedding of the local neighborhood structure around seeds of interest. We then seek a sparse approximate indicator vector in the local spectral subspace spanned by these vectors such that the seeds are in its support. We evaluate LOSP on five large real world networks across various domains with labeled ground-truth communities and compare the results with the state-of-the-art community detection approaches. LOSP identifies the members of a target community with high accuracy from very few seed members, and outperforms the local Heat Kernel or PageRank diffusions as well as the global baselines. Two candidate definitions of the local spectral subspace are analyzed, and different community scoring functions for determining the community boundary, including two new metrics, are thoroughly evaluated. The structural properties of different seed sets and the impact of the seed set size are discussed. We observe low degree seeds behave better, and LOSP is robust even when started from a single random seed. Using LOSP as a subroutine and starting from each ego connected component, we try the harder yet significant task of identifying all communities a single vertex is in. Experiments show that the proposed method achieves high F1 measures on the detected multiple local overlapping communities containing the seed vertex.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2015-09
issn 2331-8422
language eng
recordid cdi_proquest_journals_2083137187
source Publicly Available Content (ProQuest)
subjects Communities
Domains
Eigenvectors
Search engines
Seeds
Spectra
Subspaces
title Detecting Overlapping Communities from Local Spectral Subspaces
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-31T23%3A14%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Detecting%20Overlapping%20Communities%20from%20Local%20Spectral%20Subspaces&rft.jtitle=arXiv.org&rft.au=He,%20Kun&rft.date=2015-09-27&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2083137187%3C/proquest%3E%3Cgrp_id%3Ecdi_FETCH-proquest_journals_20831371873%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=2083137187&rft_id=info:pmid/&rfr_iscdi=true