Subsampled Optimization: Statistical Guarantees, Mean Squared Error Approximation, and Sampling Method
Rong Zhu, Jiming Jiang

TL;DR
This paper develops theoretical guarantees and practical sampling methods for subsampled optimization, enabling efficient approximation of large-scale data solutions with controlled error and confidence.
Contribution
It provides asymptotic properties, MSE approximation, and an optimal Hessian-based sampling method for subsampled optimization.
Findings
Asymptotic properties of approximate solutions are established.
An accurate MSE approximation and unbiased estimator are derived.
Hessian-based sampling improves efficiency in numerical experiments.
Abstract
For optimization on large-scale data, exactly calculating its solution may be computationally difficulty because of the large size of the data. In this paper we consider subsampled optimization for fast approximating the exact solution. In this approach, one gets a surrogate dataset by sampling from the full data, and then obtains an approximate solution by solving the subsampled optimization based on the surrogate. One main theoretical contributions are to provide the asymptotic properties of the approximate solution with respect to the exact solution as statistical guarantees, and to rigorously derive an accurate approximation of the mean squared error (MSE) and an approximately unbiased MSE estimator. These results help us better diagnose the subsampled optimization in the context that a confidence region on the exact solution is provided using the approximate solution. The other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research · Blind Source Separation Techniques
MethodsLogistic Regression
