A better convergence analysis of the block coordinate descent method for   large scale machine learning

Ziqiang Shi; Rujie Liu

arXiv:1608.04826·math.OC·August 18, 2016·2 cites

A better convergence analysis of the block coordinate descent method for large scale machine learning

Ziqiang Shi, Rujie Liu

PDF

Open Access

TL;DR

This paper provides a significantly improved convergence analysis for the block coordinate descent method applied to large-scale smooth convex optimization, introducing a new lower bound on its complexity.

Contribution

It introduces the lowest known lower bound on the information-based complexity of BCD, using the Performance Estimation Problem technique.

Findings

01

New lower bound is 16p^3 times smaller than previous bounds.

02

Numerical tests confirm the theoretical analysis.

03

Enhanced understanding of BCD convergence properties.

Abstract

This paper considers the problems of unconstrained minimization of large scale smooth convex functions having block-coordinate-wise Lipschitz continuous gradients. The block coordinate descent (BCD) method are among the first optimization schemes suggested for solving such problems \cite{nesterov2012efficiency}. We obtain a new lower (to our best knowledge the lowest currently) bound that is $16 p^{3}$ times smaller than the best known on the information-based complexity of BCD method based on an effective technique called Performance Estimation Problem (PEP) proposed by Drori and Teboulle \cite{drori2012performance} recently for analyzing the performance of first-order black box optimization methods. Numerical test confirms our analysis.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Machine Learning and Algorithms