Accelerating Optimization via Differentiable Stopping Time
Zhonglin Xie, Yiman Fong, Haoran Yuan, Zaiwen Wen

TL;DR
This paper introduces a differentiable stopping time framework that allows for gradient-based optimization of the time to reach a target loss, enabling faster and more efficient hyperparameter tuning and optimization in machine learning.
Contribution
It proposes a novel differentiable stopping time based on differential equations, facilitating gradient-based optimization for timing in machine learning algorithms.
Findings
Superior performance in diverse experiments
Effective for online hyperparameter tuning
Enables differentiable optimization of stopping times
Abstract
Optimization is an important module of modern machine learning applications. Tremendous efforts have been made to accelerate optimization algorithms. A common formulation is achieving a lower loss at a given time. This enables a differentiable framework with respect to the algorithm hyperparameters. In contrast, its dual, minimizing the time to reach a target loss, is believed to be non-differentiable, as the time is not differentiable. As a result, it usually serves as a conceptual framework or is optimized using zeroth-order methods. To address this limitation, we propose a differentiable stopping time and theoretically justify it based on differential equations. An efficient algorithm is designed to backpropagate through it. As a result, the proposed differentiable stopping time enables a new differentiable formulation for accelerating algorithms. We further discuss its applications,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Advanced Optimization Algorithms Research · Optimization and Search Problems
