Structure-Adaptive, Variance-Reduced, and Accelerated Stochastic Optimization
Junqi Tang, Francis Bach, Mohammad Golbabaee, Mike Davies

TL;DR
This paper introduces structure-adaptive, variance-reduced, and accelerated stochastic optimization algorithms tailored for regularized empirical risk minimization, demonstrating improved convergence rates and adaptability to low-dimensional solution structures like sparsity and low-rank.
Contribution
It establishes fast convergence of SAGA under convex constraints, proposes a two-stage accelerated coordinate descent method, and develops an adaptive variant that estimates problem structure dynamically.
Findings
The SAGA algorithm achieves linear convergence on non-strongly-convex objectives with constraints.
The two-stage APCG method attains global and local accelerated linear convergence.
The adaptive variant effectively estimates restricted strong convexity during optimization.
Abstract
In this work we explore the fundamental structure-adaptiveness of state of the art randomized first order algorithms on regularized empirical risk minimization tasks, where the solution has intrinsic low-dimensional structure (such as sparsity and low-rank). Such structure is often enforced by non-smooth regularization or constraints. We start by establishing the fast linear convergence rate of the SAGA algorithm on non-strongly-convex objectives with convex constraints, via an argument of cone-restricted strong convexity. Then for the composite minimization task with a coordinate-wise separable convex regularization term, we propose and analyse a two stage accelerated coordinate descend algorithm (Two-Stage APCG). We provide the convergence analysis showing that the proposed method has a global convergence in general and enjoys a local accelerated linear convergence rate with respect…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Face and Expression Recognition
