An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS
Henry Taylor, Leonardo Stella

TL;DR
This paper introduces an evolutionary framework to systematically compare advanced versions of RL, Minimax, and MCTS algorithms in Connect-4, providing insights into their relative performance and decision-making efficiency.
Contribution
It develops a novel evaluation method called the Evolutionary Tournament and compares advanced algorithms against standard ones in Connect-4.
Findings
MCTS achieves the highest win percentage.
Minimax outperforms standard versions.
Q-Learning is the fastest decision-maker.
Abstract
A major challenge in decision making domains with large state spaces is to effectively select actions which maximize utility. In recent years, approaches such as reinforcement learning (RL) and search algorithms have been successful to tackle this issue, despite their differences. RL defines a learning framework that an agent explores and interacts with. Search algorithms provide a formalism to search for a solution. However, it is often difficult to evaluate the performances of such approaches in a practical way. Motivated by this problem, we focus on one game domain, i.e., Connect-4, and develop a novel evolutionary framework to evaluate three classes of algorithms: RL, Minimax and Monte Carlo tree search (MCTS). The contribution of this paper is threefold: i) we implement advanced versions of these algorithms and provide a systematic comparison with their standard counterpart, ii) we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Transformation in Industry
MethodsFocus · Q-Learning
