Performance Limits of Stochastic Sub-Gradient Learning, Part I: Single Agent Case
Bicheng Ying, Ali H. Sayed

TL;DR
This paper analyzes the performance limits of stochastic sub-gradient methods in single-agent learning, showing they can achieve linear convergence under weaker conditions than traditionally assumed, with practical implications for problems like SVM and LASSO.
Contribution
It introduces new weaker conditions under which stochastic sub-gradient methods attain linear convergence, extending applicability to important problems like SVM, LASSO, and denoising.
Findings
Stochastic sub-gradient methods can achieve linear convergence rates.
Weaker conditions than traditional assumptions are sufficient for convergence.
Performance bounds are established using exponential-weighting smoothing.
Abstract
In this work and the supporting Part II, we examine the performance of stochastic sub-gradient learning strategies under weaker conditions than usually considered in the literature. The new conditions are shown to be automatically satisfied by several important cases of interest including SVM, LASSO, and Total-Variation denoising formulations. In comparison, these problems do not satisfy the traditional assumptions used in prior analyses and, therefore, conclusions derived from these earlier treatments are not directly applicable to these problems. The results in this article establish that stochastic sub-gradient strategies can attain linear convergence rates, as opposed to sub-linear rates, to the steady-state regime. A realizable exponential-weighting procedure is employed to smooth the intermediate iterates and guarantee useful performance bounds in terms of convergence rate and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and ELM · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning
MethodsSupport Vector Machine
