Statistical Estimation from Dependent Data
Yuval Dagan, Constantinos Daskalakis, Nishanth Dikkala, Surbhi Goel,, Anthimos Vardis Kandiros

TL;DR
This paper develops algorithms and theoretical guarantees for statistical estimation in dependent data settings modeled by Markov Random Fields, applicable to logistic regression, neural networks, and real network data.
Contribution
It introduces methods for efficient parameter estimation in dependent data models, especially Ising models, without assuming high-temperature conditions.
Findings
Algorithms achieve statistically efficient rates.
Outperforms standard regression on network datasets.
Validates approach on real text classification data.
Abstract
We consider a general statistical estimation problem wherein binary labels across different observations are not independent conditioned on their feature vectors, but dependent, capturing settings where e.g. these observations are collected on a spatial domain, a temporal domain, or a social network, which induce dependencies. We model these dependencies in the language of Markov Random Fields and, importantly, allow these dependencies to be substantial, i.e do not assume that the Markov Random Field capturing these dependencies is in high temperature. As our main contribution we provide algorithms and statistically efficient estimation rates for this model, giving several instantiations of our bounds in logistic regression, sparse logistic regression, and neural network settings with dependent data. Our estimation guarantees follow from novel results for estimating the parameters (i.e.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsComplex Network Analysis Techniques · Statistical Methods and Inference · Bayesian Modeling and Causal Inference
