Projected Stochastic Momentum Methods for Nonlinear Equality-Constrained Optimization for Machine Learning

Qi Wang; Christian Piermarini; Yunlang Zhu; Frank E. Curtis

arXiv:2601.11795·math.OC·January 21, 2026

Projected Stochastic Momentum Methods for Nonlinear Equality-Constrained Optimization for Machine Learning

Qi Wang, Christian Piermarini, Yunlang Zhu, Frank E. Curtis

PDF

Open Access

TL;DR

This paper introduces two stochastic momentum algorithms extended for nonlinear equality-constrained optimization in machine learning, providing convergence guarantees and demonstrating practical benefits over regularization methods through extensive experiments.

Contribution

It develops and analyzes the first stochastic momentum methods tailored for nonlinear equality-constrained problems, with projected gradient-based momentum implementation.

Findings

01

Algorithms achieve convergence guarantees comparable to unconstrained methods.

02

Numerical experiments show improved performance over regularization-based approaches.

03

Constrained optimization offers practical advantages in supervised learning tasks.

Abstract

Two algorithms are proposed, analyzed, and tested for solving continuous optimization problems with nonlinear equality constraints. Each is an extension of a stochastic momentum-based method from the unconstrained setting to the setting of a stochastic Newton-SQP-type algorithm for solving equality-constrained problems. One is an extension of the heavy-ball method and the other is an extension of the Adam optimization method. Convergence guarantees for the algorithms for the constrained setting are provided that are on par with state-of-the-art guarantees for their unconstrained counterparts. A critical feature of each extension is that the momentum terms are implemented with projected gradient estimates, rather than with the gradient estimates themselves. The significant practical effect of this choice is seen in an extensive set of numerical experiments on solving informed supervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference · Advanced Optimization Algorithms Research