Network homophily via tail inequalities
Nicola Apollonio, Paolo G. Franciosa, Daniele Santoni

TL;DR
This paper introduces a new, computationally feasible method to quantify network homophily using tail inequalities, providing reliable indices that outperform existing measures in real-world network analysis.
Contribution
The paper develops a novel class of homophily indices based on tail inequalities, addressing the NP-hardness of approximation and incorporating covariance matrix computation within the random coloring model.
Findings
New homophily indices are effective in real-world networks.
The indices are computationally feasible and reliable.
They enable discoveries beyond current state-of-the-art methods.
Abstract
Homophily is the principle whereby "similarity breeds connections". We give a quantitative formulation of this principle within networks. Given a network and a labeled partition of its vertices, the vector indexed by each class of the partition, whose entries are the number of edges of the subgraphs induced by the corresponding classes, is viewed as the observed outcome of the random vector described by picking labeled partitions at random among labeled partitions whose classes have the same cardinalities as the given one. In this perspective, the value of any homophily score , namely a non decreasing real valued function in the sizes of subgraphs induced by the classes of the partition, evaluated at the observed outcome, can be thought of as the observed value of a random variable. Consequently, according to the score , the input network is homophillic at the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Topological and Geometric Data Analysis · Opinion Dynamics and Social Influence
