Learning for Edge-Weighted Online Bipartite Matching with Robustness   Guarantees

Pengfei Li; Jianyi Yang; Shaolei Ren

arXiv:2306.00172·cs.LG·June 2, 2023·1 cites

Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees

Pengfei Li, Jianyi Yang, Shaolei Ren

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces LOMAR, a novel reinforcement learning approach for online bipartite matching that guarantees robustness and competitive performance, balancing worst-case guarantees with improved average-case results.

Contribution

The paper proposes a new online switching operation in RL-based matching, providing robustness guarantees and balancing average and worst-case performance.

Findings

01

LOMAR achieves $ ho$-competitiveness for any $ ho ext{ in }[0,1]$.

02

Empirical results show LOMAR outperforms existing baselines.

03

The approach effectively balances robustness and average performance.

Abstract

Many problems, such as online ad display, can be formulated as online bipartite matching. The crucial challenge lies in the nature of sequentially-revealed online item information, based on which we make irreversible matching decisions at each step. While numerous expert online algorithms have been proposed with bounded worst-case competitive ratios, they may not offer satisfactory performance in average cases. On the other hand, reinforcement learning (RL) has been applied to improve the average performance, but it lacks robustness and can perform arbitrarily poorly. In this paper, we propose a novel RL-based approach to edge-weighted online bipartite matching with robustness guarantees (LOMAR), achieving both good average-case and worst-case performance. The key novelty of LOMAR is a new online switching operation which, based on a judicious condition to hedge against future…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ren-research/lomar
pytorchOfficial

Videos

Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees· slideslive

Taxonomy

TopicsOptimization and Search Problems · Advanced Bandit Algorithms Research · Auction Theory and Applications