Confidence Intervals for Algorithmic Leveraging in Linear Regression
Katelyn Gao

TL;DR
This paper develops efficient algorithms to construct finite sample confidence intervals for regression coefficients estimated via algorithmic leveraging, providing asymptotic coverage guarantees and empirical validation.
Contribution
It introduces novel algorithms for confidence interval construction in large-scale linear regression using algorithmic leveraging, with proven coverage guarantees.
Findings
Confidence intervals achieve desired coverage probabilities in simulations
Bootstrap intervals may not maintain coverage
Algorithms are computationally efficient for big data
Abstract
The age of big data has produced data sets that are computationally expensive to analyze and store. Algorithmic leveraging proposes that we sample observations from the original data set to generate a representative data set and then perform analysis on the representative data set. In this paper, we present efficient algorithms for constructing finite sample confidence intervals for each algorithmic leveraging estimated regression coefficient, with asymptotic coverage guarantees. In simulations, we confirm empirically that the confidence intervals have the desired coverage probabilities, while bootstrap confidence intervals may not.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Distributed Sensor Networks and Detection Algorithms · Gaussian Processes and Bayesian Inference
