Optimal Diagonal Preconditioning
Zhaonan Qu, Wenzhi Gao, Oliver Hinder, Yinyu Ye, Zhengyuan Zhou

TL;DR
This paper develops algorithms for optimal diagonal preconditioning to maximize condition number reduction, providing theoretical guarantees and demonstrating significant practical improvements over heuristic methods.
Contribution
It introduces a quasi-convex reformulation and efficient algorithms for optimal diagonal preconditioning, including specialized solvers for one-sided problems and extensive empirical validation.
Findings
Optimal diagonal preconditioners significantly reduce condition numbers.
The proposed algorithms outperform heuristic preconditioners in speed and effectiveness.
Practical methods scale to large matrices with up to 200,000 size.
Abstract
Preconditioning has long been a staple technique in optimization, often applied to reduce the condition number of a matrix and speed up the convergence of algorithms. Although there are many popular preconditioning techniques in practice, most lack guarantees on reductions in condition number. Moreover, the degree to which we can improve over existing heuristic preconditioners remains an important practical question. In this paper, we study the problem of optimal diagonal preconditioning that achieves maximal reduction in the condition number of any full-rank matrix by scaling its rows and/or columns. We first reformulate the problem as a quasi-convex problem and provide a simple algorithm based on bisection. Then we develop an interior point algorithm with iteration complexity, where each iteration consists of a Newton update based on the Nesterov-Todd direction.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Matrix Theory and Algorithms · Stochastic Gradient Optimization Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
