Loading…

A distributed overlapping community detection model for large graphs using autoencoder

Community detection has become pervasive in finding similar patterns present in the network. It aims to discover lower dimensional embedding for representing the structure of network. Many real-life networks comprise overlapping communities and have non-linear features. Despite of having a great pot...

Full description

Saved in:
Bibliographic Details
Published in:Future generation computer systems 2019-05, Vol.94, p.16-26
Main Authors: Bhatia, Vandana, Rani, Rinkle
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Community detection has become pervasive in finding similar patterns present in the network. It aims to discover lower dimensional embedding for representing the structure of network. Many real-life networks comprise overlapping communities and have non-linear features. Despite of having a great potential in analyzing the network structure, the existing approaches provide a limited support and find disjoint communities only. As data is growing unprecedentedly, scalable and intelligent solutions are obligatory for identifying similar patterns. Motivated by the robust representation ability of deep neural network based autoencoder, we proposed a learning model named ‘DeCom’ for finding overlapping communities from large networks. DeCom uses autoencoder based layered approach to initialize candidate seed nodes and to determine the number of communities by considering the network structure. The selected seed nodes and formed clusters are refined in last layer by minimizing the reconstruction error using modularity. The performance of DeCom is compared with three state-of-art clustering algorithms by using real life networks. It is observed that the felicitous selection of seed nodes reduces the number of iterations. The experimental results reveal that the proposed DeCom scales up linearly to handle large graphs and produces better quality of clusters when compared with the other state-of-art clustering algorithms. •Proposed a parallel overlapping community detection model by leveraging autoencoder pipelines for large graphs.•It finds the number of communities by analyzing the structure of the graph.•DeCom scales up well to handle large graphs.•DeCom Outperforms the competent algorithms in terms of quality and processing time.
ISSN:0167-739X
1872-7115
DOI:10.1016/j.future.2018.10.045