Data Sampling Strategies in Stochastic Algorithms for Empirical Risk   Minimization

Dominik Csiba

arXiv:1804.00437·math.OC·April 3, 2018·1 cites

Data Sampling Strategies in Stochastic Algorithms for Empirical Risk Minimization

Dominik Csiba

PDF

Open Access

TL;DR

This paper develops and analyzes advanced data sampling strategies for stochastic gradient descent methods in big data optimization, introducing a flexible framework that broadens applicability and improves efficiency.

Contribution

It introduces new state-of-the-art sampling strategies for convex problems and a generalized framework applicable to diverse problems and sampling rules.

Findings

01

New sampling strategies outperform existing methods.

02

A flexible framework broadens the applicability of stochastic algorithms.

03

Enhanced efficiency in large-scale convex optimization.

Abstract

Gradient descent methods and especially their stochastic variants have become highly popular in the last decade due to their efficiency on big data optimization problems. In this thesis we present the development of data sampling strategies for these methods. In the first four chapters we focus on four views on the sampling for convex problems, developing and analyzing new state-of-the-art methods using non-standard data sampling strategies. Finally, in the last chapter we present a more flexible framework, which generalizes to more problems as well as more sampling rules.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Statistical Methods and Inference · Risk and Portfolio Optimization