Online Inference of Constrained Optimization: Primal-Dual Optimality and Sequential Quadratic Programming
Yihang Gao, Michael K. Ng, Michael W. Mahoney, Sen Na

TL;DR
This paper introduces an online stochastic optimization method, SSQP, that efficiently solves constrained problems and provides valid statistical inference without intractable projections, demonstrating superior performance in various applications.
Contribution
The paper develops the first fully online primal-dual method with asymptotic minimax optimality for constrained stochastic optimization, avoiding intractable projections.
Findings
Achieves global almost-sure convergence.
Exhibits local asymptotic normality with optimal covariance.
Demonstrates superior empirical performance on benchmark and real data.
Abstract
We study online statistical inference for the solutions of stochastic optimization problems with equality and inequality constraints. Such problems are prevalent in statistics and machine learning, encompassing constrained -estimation, physics-informed models, safe reinforcement learning, and algorithmic fairness. We develop a stochastic sequential quadratic programming (SSQP) method to solve these problems, where the step direction is computed by sequentially performing a quadratic approximation of the objective and a linear approximation of the constraints. Despite having access to unbiased estimates of population gradients, a key challenge in constrained stochastic problems lies in dealing with the bias in the step direction. As such, we apply a momentum-style gradient moving-average technique within SSQP to debias the step. We show that our method achieves global almost-sure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference
