Data Sampling Strategies in Stochastic Algorithms for Empirical Risk Minimization
Dominik Csiba

TL;DR
This paper develops and analyzes advanced data sampling strategies for stochastic gradient descent methods in big data optimization, introducing a flexible framework that broadens applicability and improves efficiency.
Contribution
It introduces new state-of-the-art sampling strategies for convex problems and a generalized framework applicable to diverse problems and sampling rules.
Findings
New sampling strategies outperform existing methods.
A flexible framework broadens the applicability of stochastic algorithms.
Enhanced efficiency in large-scale convex optimization.
Abstract
Gradient descent methods and especially their stochastic variants have become highly popular in the last decade due to their efficiency on big data optimization problems. In this thesis we present the development of data sampling strategies for these methods. In the first four chapters we focus on four views on the sampling for convex problems, developing and analyzing new state-of-the-art methods using non-standard data sampling strategies. Finally, in the last chapter we present a more flexible framework, which generalizes to more problems as well as more sampling rules.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Statistical Methods and Inference · Risk and Portfolio Optimization
