Performance Limits of Stochastic Sub-Gradient Learning, Part I: Single   Agent Case

Bicheng Ying; Ali H. Sayed

arXiv:1511.07902·stat.ML·April 24, 2017

Performance Limits of Stochastic Sub-Gradient Learning, Part I: Single Agent Case

Bicheng Ying, Ali H. Sayed

PDF

Open Access

TL;DR

This paper analyzes the performance limits of stochastic sub-gradient methods in single-agent learning, showing they can achieve linear convergence under weaker conditions than traditionally assumed, with practical implications for problems like SVM and LASSO.

Contribution

It introduces new weaker conditions under which stochastic sub-gradient methods attain linear convergence, extending applicability to important problems like SVM, LASSO, and denoising.

Findings

01

Stochastic sub-gradient methods can achieve linear convergence rates.

02

Weaker conditions than traditional assumptions are sufficient for convergence.

03

Performance bounds are established using exponential-weighting smoothing.

Abstract

In this work and the supporting Part II, we examine the performance of stochastic sub-gradient learning strategies under weaker conditions than usually considered in the literature. The new conditions are shown to be automatically satisfied by several important cases of interest including SVM, LASSO, and Total-Variation denoising formulations. In comparison, these problems do not satisfy the traditional assumptions used in prior analyses and, therefore, conclusions derived from these earlier treatments are not directly applicable to these problems. The results in this article establish that stochastic sub-gradient strategies can attain linear convergence rates, as opposed to sub-linear rates, to the steady-state regime. A realizable exponential-weighting procedure is employed to smooth the intermediate iterates and guarantee useful performance bounds in terms of convergence rate and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and ELM · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning

MethodsSupport Vector Machine