RSN: Randomized Subspace Newton
Robert M. Gower, Dmitry Kovalev, Felix Lieder, Peter Richt\'arik

TL;DR
The paper introduces a randomized Newton method that efficiently solves high-dimensional learning problems by using sketching techniques, offering a flexible and convergent approach suitable for large-scale applications.
Contribution
It presents a novel randomized Newton method leveraging sketching constrained directions, with a universal convergence theory applicable to various sketching techniques.
Findings
Demonstrates superior efficiency over gradient descent and full Newton methods
Provides a flexible framework for custom sketching in large-scale problems
Establishes a global linear convergence guarantee for the proposed method
Abstract
We develop a randomized Newton method capable of solving learning problems with huge dimensional feature spaces, which is a common setting in applications such as medical imaging, genomics and seismology. Our method leverages randomized sketching in a new way, by finding the Newton direction constrained to the space spanned by a random sketch. We develop a simple global linear convergence theory that holds for practically all sketching techniques, which gives the practitioners the freedom to design custom sketching approaches suitable for particular applications. We perform numerical experiments which demonstrate the efficiency of our method as compared to accelerated gradient descent and the full Newton method. Our method can be seen as a refinement and randomized extension of the results of Karimireddy, Stich, and Jaggi (2019).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms
