Loading…

Vertex nomination: The canonical sampling and the extended spectral nomination schemes

Suppose that one particular block in a stochastic block model is of interest, but block labels are only observed for a few of the vertices in the network. Utilizing a graph realized from the model and the observed block labels, the vertex nomination task is to order the vertices with unobserved bloc...

Full description

Saved in:
Bibliographic Details
Published in:Computational statistics & data analysis 2020-05, Vol.145, p.106916, Article 106916
Main Authors: Yoder, Jordan, Chen, Li, Pao, Henry, Bridgeford, Eric, Levin, Keith, Fishkind, Donniell E., Priebe, Carey, Lyzinski, Vince
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Suppose that one particular block in a stochastic block model is of interest, but block labels are only observed for a few of the vertices in the network. Utilizing a graph realized from the model and the observed block labels, the vertex nomination task is to order the vertices with unobserved block labels into a ranked nomination list with the goal of having an abundance of interesting vertices near the top of the list. There are vertex nomination schemes in the literature, including the optimally precise canonical nomination scheme LC and the consistent spectral partitioning nomination scheme LP. While the canonical nomination scheme LC is provably optimally precise, it is computationally intractable, being impractical to implement even on modestly sized graphs. With this in mind, an approximation of the canonical scheme – denoted the canonical sampling nomination schemeLCS – is introduced; LCS relies on a scalable, Markov chain Monte Carlo-based approximation of LC, and converges to LC as the amount of sampling goes to infinity. The spectral partitioning nomination scheme is also extended to the extended spectral partitioning nomination scheme, LEP, which introduces a novel semisupervised clustering framework to improve upon the precision of LP. Real-data and simulation experiments are employed to illustrate the precision of these vertex nomination schemes, as well as their empirical computational complexity.
ISSN:0167-9473
1872-7352
DOI:10.1016/j.csda.2020.106916