Accelerated Mini-Batch Stochastic Dual Coordinate Ascent

Shai Shalev-Shwartz; Tong Zhang

arXiv:1305.2581·stat.ML·May 14, 2013·61 cites

Accelerated Mini-Batch Stochastic Dual Coordinate Ascent

Shai Shalev-Shwartz, Tong Zhang

PDF

Open Access

TL;DR

This paper introduces an accelerated mini-batch stochastic dual coordinate ascent method that improves convergence rates for regularized loss minimization problems in machine learning, with practical parallel implementation.

Contribution

It presents a novel accelerated mini-batch SDCA algorithm with proven fast convergence, extending SDCA to more efficient parallelizable optimization.

Findings

01

Proves a faster convergence rate for the proposed method.

02

Demonstrates improved performance over vanilla SDCA.

03

Shows competitiveness with accelerated gradient descent methods.

Abstract

Stochastic dual coordinate ascent (SDCA) is an effective technique for solving regularized loss minimization problems in machine learning. This paper considers an extension of SDCA under the mini-batch setting that is often used in practice. Our main contribution is to introduce an accelerated mini-batch version of SDCA and prove a fast convergence rate for this method. We discuss an implementation of our method over a parallel computing system, and compare the results to both the vanilla stochastic dual coordinate ascent and to the accelerated deterministic gradient descent method of \cite{nesterov2007gradient}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Markov Chains and Monte Carlo Methods