Randomized Subspace Derivative-Free Optimization with Quadratic Models   and Second-Order Convergence

Coralia Cartis; Lindon Roberts

arXiv:2412.14431·math.OC·December 20, 2024

Randomized Subspace Derivative-Free Optimization with Quadratic Models and Second-Order Convergence

Coralia Cartis, Lindon Roberts

PDF

Open Access

TL;DR

This paper introduces a new derivative-free optimization method that uses random subspaces and quadratic models, achieving second-order convergence with improved dimension dependence, enabling larger problem solving.

Contribution

It provides the first worst-case complexity bounds for subspace DFO methods with second-order convergence and introduces a practical quadratic interpolation approach for large-scale problems.

Findings

01

First worst-case complexity bounds for subspace DFO with second-order convergence

02

Significantly improved dimension dependence over full-space methods

03

Able to solve larger problems efficiently while maintaining performance on medium-scale problems

Abstract

We consider model-based derivative-free optimization (DFO) for large-scale problems, based on iterative minimization in random subspaces. We provide the first worst-case complexity bound for such methods for convergence to approximate second-order critical points, and show that these bounds have significantly improved dimension dependence compared to standard full-space methods, provided low accuracy solutions are desired and/or the problem has low effective rank. We also introduce a practical subspace model-based method suitable for general objective minimization, based on iterative quadratic interpolation in subspaces, and show that it can solve significantly larger problems than state-of-the-art full-space methods, while also having comparable performance on medium-scale problems when allowed to use full-dimension subspaces.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMatrix Theory and Algorithms · Iterative Learning Control Systems · Stochastic Gradient Optimization Techniques