OLIVAW: Mastering Othello without Human Knowledge, nor a Fortune

Antonio Norelli; Alessandro Panconesi

arXiv:2103.17228·cs.LG·March 7, 2022

OLIVAW: Mastering Othello without Human Knowledge, nor a Fortune

Antonio Norelli, Alessandro Panconesi

PDF

Open Access

TL;DR

OLIVAW demonstrates that applying AlphaGo Zero's reinforcement learning paradigm to Othello enables the creation of a highly competent AI using only affordable hardware and free cloud resources, surpassing traditional engines and human players.

Contribution

This work adapts AlphaGo Zero's approach to Othello, introducing specific improvements to accelerate learning and achieve high-level play without human knowledge or extensive resources.

Findings

01

OLIVAW outperforms the strongest open-source Othello engine Edax.

02

OLIVAW defeats top human players in matches.

03

The approach requires only commodity hardware and free cloud services.

Abstract

We introduce OLIVAW, an AI Othello player adopting the design principles of the famous AlphaGo programs. The main motivation behind OLIVAW was to attain exceptional competence in a non-trivial board game at a tiny fraction of the cost of its illustrious predecessors. In this paper, we show how the AlphaGo Zero's paradigm can be successfully applied to the popular game of Othello using only commodity hardware and free cloud services. While being simpler than Chess or Go, Othello maintains a considerable search space and difficulty in evaluating board positions. To achieve this result, OLIVAW implements some improvements inspired by recent works to accelerate the standard AlphaGo Zero learning process. The main modification implies doubling the positions collected per game during the training phase, by including also positions not played but largely explored by the agent. We tested the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games