Some Unified Theory for Variance Reduced Prox-Linear Methods

Yue Wu; Benjamin Grimmer

arXiv:2412.15008·math.OC·October 16, 2025

Some Unified Theory for Variance Reduced Prox-Linear Methods

Yue Wu, Benjamin Grimmer

PDF

Open Access

TL;DR

This paper develops a unified convergence theory for variance-reduced prox-linear methods applied to nonconvex, nonsmooth composite optimization problems, improving theoretical guarantees and accommodating inexact computations.

Contribution

It introduces a unified convergence framework that simplifies assumptions, enhances guarantees, and broadens applicability of variance-reduced prox-linear algorithms.

Findings

01

Operator norm bounds suffice for convergence analysis

02

State-of-the-art high probability guarantees achieved

03

Inexact proximal computations are supported

Abstract

This work considers the nonconvex, nonsmooth problem of minimizing a composite objective of the form $f (g (x)) + h (x)$ where the inner mapping $g$ is a smooth finite summation or expectation amenable to variance reduction. In such settings, prox-linear methods can enjoy variance-reduced speed-ups despite the existence of nonsmoothness. We provide a unified convergence theory applicable to a wide range of common variance-reduced vector and Jacobian constructions. All the technical conditions we required for variance-reduced methods can be summarized in a single unified assumption. Our theory (i) only requires operator norm bounds on Jacobians (whereas prior works used potentially much larger Frobenius norms), (ii) provides state-of-the-art high probability guarantees, and (iii) allows inexactness in proximal computations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Optimization Algorithms Research · Matrix Theory and Algorithms · Control Systems and Identification