Learning Social Networks from Text Data using Covariate Information
Xiaoyi Yang, Nynke M.D. Niezink, Rebecca Nugent

TL;DR
This paper introduces an extended Local Poisson Graphical Lasso model that incorporates covariate information to improve the accuracy of social network extraction from historical biographical texts.
Contribution
It extends the existing model with a penalty structure using covariates and proposes greedy and Bayesian estimation methods, enhancing network recovery from text data.
Findings
Improved network recovery in simulated data with historical characteristics
Enhanced precision and recall in identifying true social links
Successful application to early modern British biographical data
Abstract
Describing and characterizing the impact of historical figures can be challenging, but unraveling their social structures perhaps even more so. Historical social network analysis methods can help and may also illuminate people who have been overlooked by historians but turn out to be influential social connection points. Text data, such as biographies, can be a useful source of information about the structure of historical social networks but can also introduce challenges in identifying links. The Local Poisson Graphical Lasso model leverages the number of co-mentions in the text to measure relationships between people and uses a conditional independence structure to model a social network. This structure will reduce the tendency to overstate the relationship between "friends of friends", but given the historical high frequency of common names, without additional distinguishing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
