Smoothed Online Combinatorial Optimization Using Imperfect Predictions
Kai Wang, Zhao Song, Georgios Theocharous, Sridhar Mahadevan

TL;DR
This paper develops an online optimization algorithm that leverages imperfect predictions to reduce regret in combinatorial decision-making, balancing uncertainty and switching costs.
Contribution
It introduces a planning-based approach that accounts for predictive uncertainty, providing theoretical regret bounds and demonstrating empirical improvements.
Findings
Regret depends on predictive uncertainty and switching costs.
Choosing an optimal planning window balances uncertainty and switching.
Empirical results show significant regret reduction over baselines.
Abstract
Smoothed online combinatorial optimization considers a learner who repeatedly chooses a combinatorial decision to minimize an unknown changing cost function with a penalty on switching decisions in consecutive rounds. We study smoothed online combinatorial optimization problems when an imperfect predictive model is available, where the model can forecast the future cost functions with uncertainty. We show that using predictions to plan for a finite time horizon leads to regret dependent on the total predictive uncertainty and an additional switching cost. This observation suggests choosing a suitable planning window to balance between uncertainty and switching cost, which leads to an online algorithm with guarantees on the upper and lower bounds of the cumulative regret. Empirically, our algorithm shows a significant improvement in cumulative regret compared to other baselines in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Data Stream Mining Techniques
