Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations
Abdelrhman Shaheen, Anas Badr, Ali Abohendy, Hatem Alsaadawy, Nadine Alsayad, Ehab H. El-Shazly

TL;DR
This paper reviews Google DeepMind's reinforcement learning innovations in gaming, focusing on models like AlphaGo, AlphaGo Zero, and MuZero, highlighting their key advancements, training methods, and impact on AI development.
Contribution
It provides a comprehensive review of DeepMind's latest reinforcement learning models in strategy and Atari games, emphasizing their novel approaches and technological progress.
Findings
AlphaGo mastered Go surpassing human players
AlphaGo Zero eliminated reliance on human data
MuZero learned game dynamics without explicit rules
Abstract
Reinforcement Learning (RL) has been widely used in many applications, particularly in gaming, which serves as an excellent training ground for AI models. Google DeepMind has pioneered innovations in this field, employing reinforcement learning algorithms, including model-based, model-free, and deep Q-network approaches, to create advanced AI models such as AlphaGo, AlphaGo Zero, and MuZero. AlphaGo, the initial model, integrates supervised learning and reinforcement learning to master the game of Go, surpassing professional human players. AlphaGo Zero refines this approach by eliminating reliance on human gameplay data, instead utilizing self-play for enhanced learning efficiency. MuZero further extends these advancements by learning the underlying dynamics of game environments without explicit knowledge of the rules, achieving adaptability across various games, including complex Atari…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Evolutionary Algorithms and Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Prioritized Experience Replay · Batch Normalization · Average Pooling · Residual Connection · Residual Block · Monte-Carlo Tree Search · Convolution · MuZero
