A Framework for Statistical Inference via Randomized Algorithms
Zhixiang Zhang, Sokbae Lee, Edgar Dobriban

TL;DR
This paper introduces a statistical inference framework for randomized algorithms, enabling uncertainty quantification of their outputs, with practical methods demonstrated on stochastic optimization and randomized sketching.
Contribution
It develops novel inference methods that estimate the distribution of randomized algorithm outputs, allowing accurate uncertainty quantification in large-scale data analysis.
Findings
Inference methods work with unknown limiting distributions.
Methods are effective for stochastic optimization and sketching.
Overhead of inference methods is often negligible.
Abstract
Randomized algorithms, such as randomized sketching or stochastic optimization, are a promising approach to ease the computational burden in analyzing large datasets. However, randomized algorithms also produce non-deterministic outputs, leading to the problem of evaluating their accuracy. In this paper, we develop a statistical inference framework for quantifying the uncertainty of the outputs of randomized algorithms. Our key conclusion is that one can perform statistical inference for the target of a sequence of randomized algorithms as long as in the limit, their outputs fluctuate around the target according to any (possibly unknown) probability distribution. In this setting, we develop appropriate statistical inference methods -- sub-randomization, multi-run plug-in and multi-run aggregation -- by estimating the unknown parameters of the limiting distribution either using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Statistical Methods and Inference · Face and Expression Recognition
