Structure-Adaptive, Variance-Reduced, and Accelerated Stochastic   Optimization

Junqi Tang; Francis Bach; Mohammad Golbabaee; Mike Davies

arXiv:1712.03156·math.OC·December 13, 2017

Structure-Adaptive, Variance-Reduced, and Accelerated Stochastic Optimization

Junqi Tang, Francis Bach, Mohammad Golbabaee, Mike Davies

PDF

Open Access

TL;DR

This paper introduces structure-adaptive, variance-reduced, and accelerated stochastic optimization algorithms tailored for regularized empirical risk minimization, demonstrating improved convergence rates and adaptability to low-dimensional solution structures like sparsity and low-rank.

Contribution

It establishes fast convergence of SAGA under convex constraints, proposes a two-stage accelerated coordinate descent method, and develops an adaptive variant that estimates problem structure dynamically.

Findings

01

The SAGA algorithm achieves linear convergence on non-strongly-convex objectives with constraints.

02

The two-stage APCG method attains global and local accelerated linear convergence.

03

The adaptive variant effectively estimates restricted strong convexity during optimization.

Abstract

In this work we explore the fundamental structure-adaptiveness of state of the art randomized first order algorithms on regularized empirical risk minimization tasks, where the solution has intrinsic low-dimensional structure (such as sparsity and low-rank). Such structure is often enforced by non-smooth regularization or constraints. We start by establishing the fast linear convergence rate of the SAGA algorithm on non-strongly-convex objectives with convex constraints, via an argument of cone-restricted strong convexity. Then for the composite minimization task with a coordinate-wise separable convex regularization term, we propose and analyse a two stage accelerated coordinate descend algorithm (Two-Stage APCG). We provide the convergence analysis showing that the proposed method has a global convergence in general and enjoys a local accelerated linear convergence rate with respect…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Face and Expression Recognition