Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees
Pengfei Li, Jianyi Yang, Shaolei Ren

TL;DR
This paper introduces LOMAR, a novel reinforcement learning approach for online bipartite matching that guarantees robustness and competitive performance, balancing worst-case guarantees with improved average-case results.
Contribution
The paper proposes a new online switching operation in RL-based matching, providing robustness guarantees and balancing average and worst-case performance.
Findings
LOMAR achieves $ ho$-competitiveness for any $ ho ext{ in }[0,1]$.
Empirical results show LOMAR outperforms existing baselines.
The approach effectively balances robustness and average performance.
Abstract
Many problems, such as online ad display, can be formulated as online bipartite matching. The crucial challenge lies in the nature of sequentially-revealed online item information, based on which we make irreversible matching decisions at each step. While numerous expert online algorithms have been proposed with bounded worst-case competitive ratios, they may not offer satisfactory performance in average cases. On the other hand, reinforcement learning (RL) has been applied to improve the average performance, but it lacks robustness and can perform arbitrarily poorly. In this paper, we propose a novel RL-based approach to edge-weighted online bipartite matching with robustness guarantees (LOMAR), achieving both good average-case and worst-case performance. The key novelty of LOMAR is a new online switching operation which, based on a judicious condition to hedge against future…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsOptimization and Search Problems · Advanced Bandit Algorithms Research · Auction Theory and Applications
