Cubic regularized subspace Newton for non-convex optimization
Jim Zhao, Aurelien Lucchi, Nikita Doikov

TL;DR
This paper introduces a randomized coordinate second-order method called SSCN that uses cubic regularization in random subspaces to efficiently optimize high-dimensional non-convex functions, with proven convergence guarantees and practical speed-ups.
Contribution
It proposes a novel subspace cubic regularization method with theoretical convergence guarantees and adaptive sampling, improving efficiency in high-dimensional non-convex optimization.
Findings
Convergence guarantees for non-convex functions with subspace methods.
Complexity matches $ ilde{O}( ext{epsilon}^{-3/2})$ rate for increasing subspace size.
Experimental results show significant speed-ups over first-order methods.
Abstract
This paper addresses the optimization problem of minimizing non-convex continuous functions, which is relevant in the context of high-dimensional machine learning applications characterized by over-parametrization. We analyze a randomized coordinate second-order method named SSCN which can be interpreted as applying cubic regularization in random subspaces. This approach effectively reduces the computational complexity associated with utilizing second-order information, rendering it applicable in higher-dimensional scenarios. Theoretically, we establish convergence guarantees for non-convex functions, with interpolating rates for arbitrary subspace sizes and allowing inexact curvature estimation. When increasing subspace size, our complexity matches of the cubic regularization (CR) rate. Additionally, we propose an adaptive sampling scheme ensuring exact…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques · Optimization and Variational Analysis
