Sparse network asymptotics for logistic regression
Bryan S. Graham

TL;DR
This paper develops asymptotic theory for logistic regression on large, sparse bipartite networks, showing that standard inference methods need adjustment to account for network sparsity and dependence.
Contribution
It introduces a novel sparse network asymptotic framework for logistic regression, extending existing results to settings with sparse, large networks and dyadic dependence.
Findings
Asymptotic normality of logistic coefficients under sparsity
Variance decomposition reveals different behavior in sparse vs dense networks
Proposes variance estimators accounting for network dependence
Abstract
Consider a bipartite network where consumers choose to buy or not to buy different products. This paper considers the properties of the logistic regression of the array of i-buys-j purchase decisions, , onto known functions of consumer and product attributes under asymptotic sequences where (i) both and grow large and (ii) the average number of products purchased per consumer is finite in the limit. This latter assumption implies that the network of purchases is sparse: only a (very) small fraction of all possible purchases are actually made (concordant with many real-world settings). Under sparse network asymptotics, the first and last terms in an extended Hoeffding-type variance decomposition of the score of the logit composite log-likelihood are of equal order. In contrast, under dense network asymptotics,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Mental Health Research Topics · Statistical Methods and Inference
MethodsLogistic Regression
