Smoothed Online Combinatorial Optimization Using Imperfect Predictions

Kai Wang; Zhao Song; Georgios Theocharous; Sridhar Mahadevan

arXiv:2204.10979·cs.LG·January 18, 2023

Smoothed Online Combinatorial Optimization Using Imperfect Predictions

Kai Wang, Zhao Song, Georgios Theocharous, Sridhar Mahadevan

PDF

Open Access 1 Video

TL;DR

This paper develops an online optimization algorithm that leverages imperfect predictions to reduce regret in combinatorial decision-making, balancing uncertainty and switching costs.

Contribution

It introduces a planning-based approach that accounts for predictive uncertainty, providing theoretical regret bounds and demonstrating empirical improvements.

Findings

01

Regret depends on predictive uncertainty and switching costs.

02

Choosing an optimal planning window balances uncertainty and switching.

03

Empirical results show significant regret reduction over baselines.

Abstract

Smoothed online combinatorial optimization considers a learner who repeatedly chooses a combinatorial decision to minimize an unknown changing cost function with a penalty on switching decisions in consecutive rounds. We study smoothed online combinatorial optimization problems when an imperfect predictive model is available, where the model can forecast the future cost functions with uncertainty. We show that using predictions to plan for a finite time horizon leads to regret dependent on the total predictive uncertainty and an additional switching cost. This observation suggests choosing a suitable planning window to balance between uncertainty and switching cost, which leads to an online algorithm with guarantees on the upper and lower bounds of the cumulative regret. Empirically, our algorithm shows a significant improvement in cumulative regret compared to other baselines in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Smoothed Online Combinatorial Optimization Using Imperfect Predictions· underline

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Data Stream Mining Techniques