Stability and Generalization for Randomized Coordinate Descent

Puyu Wang; Liang Wu; Yunwen Lei

arXiv:2108.07414·cs.LG·August 18, 2021

Stability and Generalization for Randomized Coordinate Descent

Puyu Wang, Liang Wu, Yunwen Lei

PDF

Open Access

TL;DR

This paper introduces the first stability-based generalization analysis for randomized coordinate descent (RCD), demonstrating its superior stability over stochastic gradient descent and providing guidelines for early stopping to optimize generalization.

Contribution

It pioneers the stability analysis of RCD for convex and strongly convex objectives, linking stability to generalization bounds and early stopping strategies.

Findings

01

RCD has better stability than stochastic gradient descent.

02

Optimal early stopping can improve generalization.

03

Provides theoretical bounds for RCD's generalization performance.

Abstract

Randomized coordinate descent (RCD) is a popular optimization algorithm with wide applications in solving various machine learning problems, which motivates a lot of theoretical analysis on its convergence behavior. As a comparison, there is no work studying how the models trained by RCD would generalize to test examples. In this paper, we initialize the generalization analysis of RCD by leveraging the powerful tool of algorithmic stability. We establish argument stability bounds of RCD for both convex and strongly convex objectives, from which we develop optimal generalization bounds by showing how to early-stop the algorithm to tradeoff the estimation and optimization. Our analysis shows that RCD enjoys better stability as compared to stochastic gradient descent.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Face and Expression Recognition