POND: Pessimistic-Optimistic oNline Dispatching
Xin Liu, Bin Li, Pengyi Shi, Lei Ying

TL;DR
This paper introduces POND, an online dispatching algorithm that balances pessimistic and optimistic strategies to minimize regret and constraint violations in uncertain, real-time decision-making scenarios.
Contribution
The paper presents a novel online dispatching algorithm, POND, with proven optimal regret and constraint violation bounds, applicable to unknown distribution settings.
Findings
POND achieves $O(\sqrt{T})$ regret.
POND maintains $O(1)$ constraint violation.
Experimental results show low regret and minimal violations.
Abstract
This paper considers constrained online dispatching with unknown arrival, reward and constraint distributions. We propose a novel online dispatching algorithm, named POND, standing for Pessimistic-Optimistic oNline Dispatching, which achieves regret and constraint violation. Both bounds are sharp. Our experiments on synthetic and real datasets show that POND achieves low regret with minimal constraint violations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Complexity and Algorithms in Graphs
