Projected Stochastic Momentum Methods for Nonlinear Equality-Constrained Optimization for Machine Learning
Qi Wang, Christian Piermarini, Yunlang Zhu, Frank E. Curtis

TL;DR
This paper introduces two stochastic momentum algorithms extended for nonlinear equality-constrained optimization in machine learning, providing convergence guarantees and demonstrating practical benefits over regularization methods through extensive experiments.
Contribution
It develops and analyzes the first stochastic momentum methods tailored for nonlinear equality-constrained problems, with projected gradient-based momentum implementation.
Findings
Algorithms achieve convergence guarantees comparable to unconstrained methods.
Numerical experiments show improved performance over regularization-based approaches.
Constrained optimization offers practical advantages in supervised learning tasks.
Abstract
Two algorithms are proposed, analyzed, and tested for solving continuous optimization problems with nonlinear equality constraints. Each is an extension of a stochastic momentum-based method from the unconstrained setting to the setting of a stochastic Newton-SQP-type algorithm for solving equality-constrained problems. One is an extension of the heavy-ball method and the other is an extension of the Adam optimization method. Convergence guarantees for the algorithms for the constrained setting are provided that are on par with state-of-the-art guarantees for their unconstrained counterparts. A critical feature of each extension is that the momentum terms are implemented with projected gradient estimates, rather than with the gradient estimates themselves. The significant practical effect of this choice is seen in an extensive set of numerical experiments on solving informed supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference · Advanced Optimization Algorithms Research
