POND: Pessimistic-Optimistic oNline Dispatching

Xin Liu; Bin Li; Pengyi Shi; Lei Ying

arXiv:2010.09995·cs.LG·May 12, 2021·5 cites

POND: Pessimistic-Optimistic oNline Dispatching

Xin Liu, Bin Li, Pengyi Shi, Lei Ying

PDF

Open Access

TL;DR

This paper introduces POND, an online dispatching algorithm that balances pessimistic and optimistic strategies to minimize regret and constraint violations in uncertain, real-time decision-making scenarios.

Contribution

The paper presents a novel online dispatching algorithm, POND, with proven optimal regret and constraint violation bounds, applicable to unknown distribution settings.

Findings

01

POND achieves $O(\sqrt{T})$ regret.

02

POND maintains $O(1)$ constraint violation.

03

Experimental results show low regret and minimal violations.

Abstract

This paper considers constrained online dispatching with unknown arrival, reward and constraint distributions. We propose a novel online dispatching algorithm, named POND, standing for Pessimistic-Optimistic oNline Dispatching, which achieves $O (T)$ regret and $O (1)$ constraint violation. Both bounds are sharp. Our experiments on synthetic and real datasets show that POND achieves low regret with minimal constraint violations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Complexity and Algorithms in Graphs