The Stochastic Topic Block Model for the Clustering of Vertices in Networks with Textual Edges
Bouveyron Charles, Latouche Pierre, Zreik Rawya

TL;DR
This paper introduces the stochastic topic block model (STBM), a probabilistic approach for clustering network vertices based on both network structure and textual edge content, enhancing traditional network analysis methods.
Contribution
The paper presents the STBM model and a C-VEM algorithm for joint clustering of vertices considering network links and textual data, a novel integration in network analysis.
Findings
Effective in identifying meaningful clusters in simulated data
Successfully applied to real-world communication and co-authorship networks
Improves clustering accuracy by incorporating textual information
Abstract
Due to the significant increase of communications between individuals via social media (Facebook, Twitter, Linkedin) or electronic formats (email, web, e-publication) in the past two decades, network analysis has become a unavoidable discipline. Many random graph models have been proposed to extract information from networks based on person-to-person links only, without taking into account information on the contents. This paper introduces the stochastic topic block model (STBM), a probabilistic model for networks with textual edges. We address here the problem of discovering meaningful clusters of vertices that are coherent from both the network interactions and the text contents. A classification variational expectation-maximization (C-VEM) algorithm is proposed to perform inference. Simulated data sets are considered in order to assess the proposed approach and to highlight its main…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
