Combinatorial Bandits for Sequential Learning in Colonel Blotto Games

Dong Quan Vu; Patrick Loiseau (MPI-SWS; POLARIS); Alonso Silva

arXiv:1909.04912·cs.GT·September 12, 2019

Combinatorial Bandits for Sequential Learning in Colonel Blotto Games

Dong Quan Vu, Patrick Loiseau (MPI-SWS, POLARIS), Alonso Silva

PDF

1 Repo

TL;DR

This paper introduces a new regret-minimization approach for the Colonel Blotto game under incomplete information, utilizing combinatorial bandits and dynamic programming to improve computational efficiency and regret bounds.

Contribution

It develops an efficient algorithm for sequential resource allocation in Colonel Blotto games using combinatorial bandits and optimized exploration strategies.

Findings

01

The proposed algorithm achieves sub-linear regret in simulations.

02

Dynamic programming significantly reduces computational complexity.

03

Optimized exploration improves practical regret performance.

Abstract

The Colonel Blotto game is a renowned resource allocation problem with a long-standing literature in game theory (almost 100 years). However, its scope of application is still restricted by the lack of studies on the incomplete-information situations where a learning model is needed. In this work, we propose and study a regret-minimization model where a learner repeatedly plays the Colonel Blotto game against several adversaries. At each stage, the learner distributes her budget of resources on a fixed number of battlefields to maximize the aggregate value of battlefields she wins; each battlefield being won if there is no adversary that has higher allocation. We focus on the bandit feedback setting. We first show that it can be modeled as a path planning problem. It is then possible to use the classical COMBAND algorithm to guarantee a sub-linear regret in terms of time horizon, but…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dongquan11/BanditColonelBlotto
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.