Online Fair Revenue Maximizing Cake Division with Non-Contiguous Pieces in Adversarial Bandits
Mohammad Ghodsi, Amirmahdi Mirfakhar

TL;DR
This paper introduces an online algorithm for fair revenue-maximizing cake division with non-contiguous pieces using adversarial bandits, achieving low regret in fairness and revenue.
Contribution
It extends cake-cutting to an online setting with non-contiguous pieces and applies adversarial bandit algorithms, notably EXP_3, for fairness and revenue guarantees.
Findings
Achieves sub-linear fairness and revenue regret.
Demonstrates effectiveness of adversarial bandits in resource allocation.
Provides theoretical bounds for online cake division.
Abstract
The classic cake-cutting problem provides a model for addressing the fair and efficient allocation of a divisible, heterogeneous resource among agents with distinct preferences. Focusing on a standard formulation of cake cutting, in which each agent must receive a contiguous piece of the cake in an offline setting, this work instead focuses on online allocating non-contiguous pieces of cake among agents and establishes algorithmic results for fairness measures. In this regard, we made use of classic adversarial multi-armed bandits to achieve sub-linear Fairness and Revenue Regret at the same time. Adversarial bandits are powerful tools to model the adversarial reinforcement learning environments, that provide strong upper-bounds for regret of learning with just observing one action's reward in each step by applying smart trade-off between exploration and exploitation. This work studies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Reinforcement Learning in Robotics
