Stochastic dual coordinate descent with adaptive heavy ball momentum for linearly constrained convex optimization
Yun Zeng, Deren Han, Yansheng Su, Jiaxin Xie

TL;DR
This paper introduces an adaptive stochastic dual coordinate descent algorithm with heavy ball momentum for linearly constrained convex optimization, improving scalability and convergence in large-scale problems.
Contribution
It develops a novel adaptive momentum method that learns parameters during iterations, unifying and extending several existing algorithms for constrained convex optimization.
Findings
The method converges linearly in expectation for strongly convex functions.
Numerical experiments confirm the theoretical convergence and effectiveness.
The framework encompasses well-known algorithms like randomized Kaczmarz and conjugate gradient.
Abstract
The problem of finding a solution to the linear system with certain minimization properties arises in numerous scientific and engineering areas. In the era of big data, the stochastic optimization algorithms become increasingly significant due to their scalability for problems of unprecedented size. This paper focuses on the problem of minimizing a strongly convex function subject to linear constraints. We consider the dual formulation of this problem and adopt the stochastic coordinate descent to solve it. The proposed algorithmic framework, called adaptive stochastic dual coordinate descent, utilizes sampling matrices sampled from user-defined distributions to extract gradient information. Moreover, it employs Polyak's heavy ball momentum acceleration with adaptive parameters learned through iterations, overcoming the limitation of the heavy ball momentum method that it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Random lasers and scattering media
