Non-stationary Stochastic Optimization under $L_{p,q}$-Variation Measures
Xi Chen, Yining Wang, Yu-Xiang Wang

TL;DR
This paper introduces an $L_{p,q}$-variation measure for non-stationary stochastic optimization, deriving regret bounds that generalize previous work and reveal phenomena like the curse of dimensionality.
Contribution
It proposes a new $L_{p,q}$-variation functional, providing tight regret bounds for various convex function sequences and extending prior results in non-stationary optimization.
Findings
Derived upper and lower regret bounds under $L_{p,q}$-variation constraints.
Revealed the curse of dimensionality in the context of this variation measure.
Provided a cubic spline construction matching lower bounds.
Abstract
We consider a non-stationary sequential stochastic optimization problem, in which the underlying cost functions change over time under a variation budget constraint. We propose an -variation functional to quantify the change, which yields less variation for dynamic function sequences whose changes are constrained to short time periods or small subsets of input domain. Under the -variation constraint, we derive both upper and matching lower regret bounds for smooth and strongly convex function sequences, which generalize previous results in Besbes et al. (2015). Furthermore, we provide an upper bound for general convex function sequences with noisy gradient feedback, which matches the optimal rate as . Our results reveal some surprising phenomena under this general variation functional, such as the curse of dimensionality of the function domain. The key…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques · Reinforcement Learning in Robotics
