Covariate-Assisted Community Detection on Sparse Networks
Yaofang Hu, Wanjie Wang

TL;DR
This paper introduces a new model and spectral clustering method that effectively detects communities in sparse networks by integrating covariate information, even when covariate labels are inconsistent with community labels.
Contribution
It proposes the DCSBM-NC model and the CA-SCORE algorithm, which together improve community detection in sparse networks using covariates, handling high-dimensionality and label inconsistency.
Findings
CA-SCORE accurately recovers communities in synthetic data
Method performs well on real datasets
Effective in networks with dense connections and matching covariate labels
Abstract
Community detection is an important problem when processing network data. Traditionally, this is done by exploiting the connections between nodes, but connections can be too sparse to detect communities in many real datasets. Node covariates can be used to assist community detection; see Binkiewicz et al. (2017); Weng and Feng (2022); Yan and Sarkar (2021); Yang et al. (2013). However, how to combine covariates with network connections is challenging, because covariates may be high-dimensional and inconsistent with community labels. To study the relationship between covariates and communities, we propose the degree corrected stochastic block model with node covariates (DCSBM-NC). It allows degree heterogeneity among communities and inconsistent labels between communities and covariates. Based on DCSBM-NC, we design the adjusted neighbor-covariate (ANC) data matrix, which leverages…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Clustering Algorithms Research · Human Mobility and Location-Based Analysis
