Proximal Algorithms for Smoothed Online Convex Optimization with Predictions
Spandan Senapati, Ashwin Shenai, Ketan Rajawat

TL;DR
This paper introduces new proximal algorithms for smoothed online convex optimization with lookahead predictions, achieving improved regret bounds and broadening problem applicability.
Contribution
It develops two novel algorithms, RHAPD and RHAPD-S, for different types of convex costs, with theoretical regret bounds and empirical validation.
Findings
Algorithms outperform gradient descent methods.
Regret bounds decay exponentially with lookahead window length.
Effective on non-smooth, smooth, and strongly convex problems.
Abstract
We consider a smoothed online convex optimization (SOCO) problem with predictions, where the learner has access to a finite lookahead window of time-varying stage costs, but suffers a switching cost for changing its actions at each stage. Based on the Alternating Proximal Gradient Descent (APGD) framework, we develop Receding Horizon Alternating Proximal Descent (RHAPD) for proximable, non-smooth and strongly convex stage costs, and RHAPD-Smooth (RHAPD-S) for non-proximable, smooth and strongly convex stage costs. In addition to outperforming gradient descent-based algorithms, while maintaining a comparable runtime complexity, our proposed algorithms also allow us to solve a wider range of problems. We provide theoretical upper bounds on the dynamic regret achieved by the proposed algorithms, which decay exponentially with the length of the lookahead window. The performance of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research · Distributed Sensor Networks and Detection Algorithms
