Understanding friendship formation with explainable machine learning
Mar\'ia Pereda

TL;DR
This study uses explainable machine learning to analyze social tie formation, revealing that local network structure mainly drives relationships, with individual traits explaining only a small subset of links.
Contribution
It demonstrates the effectiveness of interpretable ML models in uncovering the dominant role of network structure over individual traits in social tie formation.
Findings
Triadic influence dominates link prediction.
A small subset (0.24%) of links is explained by individual traits.
Links explained by traits tend to be weaker, less embedded, and more negative.
Abstract
Understanding the formation of social ties requires disentangling the roles of individual traits and local network structure. We analyse signed social relationships among 3,395 students using an interpretable machine learning model -- the Explainable Boosting Machine (EBM) -- to predict link polarity from individual attributes (prosociality, cognitive reflection, and gender) and a structural metric, triadic influence. Our results show that triadic influence overwhelmingly dominates link prediction, confirming that local network structure is the primary driver of social relationships. Nevertheless, a small subset of links (0.24\%) is primarily explained by individual-level traits. A detailed characterisation of this subset reveals that these links do not arise from distinct structural conditions, but rather correspond to weaker and less structurally embedded relationships. In particular,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
