Estimating latent feature-feature interactions in large feature-rich graphs
Corrado Monti, Paolo Boldi

TL;DR
This paper introduces scalable methods to estimate latent feature interactions in large, feature-rich graphs, improving understanding of how features influence link formation beyond simple homophily models.
Contribution
It proposes two novel algorithms, based on Naive Bayes and perceptrons, to efficiently estimate feature interactions in large networks, extending the MGJ model.
Findings
Naive Bayes approach is less effective due to independence assumptions.
Perceptron-based method is fast and scalable for large datasets.
Real-world citation network analysis reveals meaningful feature interactions.
Abstract
Real-world complex networks describe connections between objects; in reality, those objects are often endowed with some kind of features. How does the presence or absence of such features interplay with the network link structure? Although the situation here described is truly ubiquitous, there is a limited body of research dealing with large graphs of this kind. Many previous works considered homophily as the only possible transmission mechanism translating node features into links. Other authors, instead, developed more sophisticated models, that are able to handle complex feature interactions, but are unfit to scale to very large networks. We expand on the MGJ model, where interactions between pairs of features can foster or discourage link formation. In this work, we will investigate how to estimate the latent feature-feature interactions in this model. We shall propose two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
