Multi-Armed Bandits for Minesweeper: Profiting from   Exploration-Exploitation Synergy

Igor Q. Lordeiro; Diego B. Haddad; Douglas O. Cardoso

arXiv:2007.12824·cs.LG·June 21, 2021

Multi-Armed Bandits for Minesweeper: Profiting from Exploration-Exploitation Synergy

Igor Q. Lordeiro, Diego B. Haddad, Douglas O. Cardoso

PDF

TL;DR

This paper explores using Reinforcement Learning, specifically Multi-Armed Bandit algorithms, to develop autonomous Minesweeper players, revealing promising results especially on smaller boards and providing new insights into the game's learning dynamics.

Contribution

It introduces a novel application of Multi-Armed Bandit algorithms for Minesweeper, offering a detailed analysis of the game's learning aspects and demonstrating effectiveness on small game boards.

Findings

01

Successful application on smaller boards like beginner level

02

Reinforcement Learning approach outperforms random strategies

03

Provides new insights into Minesweeper's learning dynamics

Abstract

A popular computer puzzle, the game of Minesweeper requires its human players to have a mix of both luck and strategy to succeed. Analyzing these aspects more formally, in our research we assessed the feasibility of a novel methodology based on Reinforcement Learning as an adequate approach to tackle the problem presented by this game. For this purpose we employed Multi-Armed Bandit algorithms which were carefully adapted in order to enable their use to define autonomous computational players, targeting to make the best use of some game peculiarities. After experimental evaluation, results showed that this approach was indeed successful, especially in smaller game boards, such as the standard beginner level. Despite this fact the main contribution of this work is a detailed examination of Minesweeper from a learning perspective, which led to various original insights which are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.