Optimization via First-Order Switching Methods: Skew-Symmetric Dynamics and Optimistic Discretization
Antesh Upadhyay, Sang Bin Moon, and Abolfazl Hashemi

TL;DR
This paper investigates the convergence properties of the Switching Gradient Method (SGM) for constrained optimization, revealing limitations under smoothness and proposing optimistic and soft switching variants for improved performance.
Contribution
The paper provides a continuous-time analysis of SGM's limitations under smoothness and introduces optimistic and soft switching methods to enhance convergence.
Findings
SGM does not automatically benefit from smoothness for faster rates.
Continuous-time analysis reveals fundamental limitations of SGM's dynamics.
Optimistic and soft switching methods can potentially improve convergence rates.
Abstract
Large-scale constrained optimization problems are at the core of many tasks in control, signal processing, and machine learning. Notably, problems with functional constraints arise when, beyond a performance{\nobreakdash-}centric goal (e.g., minimizing the empirical loss), one desires to satisfy other requirements such as robustness, fairness, etc. A simple method for such problems, which remarkably achieves optimal rates for non-smooth, convex, strongly convex, and weakly convex functions under first-order oracle, is Switching Gradient Method (SGM): in each iteration depending on a predetermined constraint violation tolerance, use the gradient of objective or the constraint as the update vector. While the performance of SGM is well-understood for non-smooth functions and in fact matches its unconstrained counterpart, i.e., Gradient Descent (GD), less is formally established about its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum chaos and dynamical systems
