Stability and Convergence Trade-off of Iterative Optimization Algorithms
Yuansi Chen, Chi Jin, Bin Yu

TL;DR
This paper explores the fundamental trade-off between convergence speed and stability in iterative machine learning algorithms, establishing bounds that link these two aspects and demonstrating their implications for generalization performance.
Contribution
It introduces a theoretical framework connecting convergence and stability, deriving new lower bounds and providing insights into the performance limits of popular algorithms.
Findings
Faster convergence algorithms tend to be less stable.
Derived lower bounds match known convergence upper bounds for gradient methods.
Numerical experiments confirm the tightness of the stability bounds.
Abstract
The overall performance or expected excess risk of an iterative machine learning algorithm can be decomposed into training error and generalization error. While the former is controlled by its convergence analysis, the latter can be tightly handled by algorithmic stability. The machine learning community has a rich history investigating convergence and stability separately. However, the question about the trade-off between these two quantities remains open. In this paper, we show that for any iterative algorithm at any iteration, the overall performance is lower bounded by the minimax statistical error over an appropriately chosen loss function class. This implies an important trade-off between convergence and stability of the algorithm -- a faster converging algorithm has to be less stable, and vice versa. As a direct consequence of this fundamental tradeoff, new convergence lower…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Stability and Convergence Trade-Off of Iterative Optimization Algorithms· youtube
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM
