From Imitation to Interaction: Mastering Game of Schnapsen with Shallow Reinforcement Learning
J\'an Kla\v{c}an, Sizhong Zhang

TL;DR
This paper demonstrates that shallow neural network agents trained with reinforcement learning can effectively master the card game Schnapsen and outperform strong search-based baselines, especially when combined with deeper lookahead.
Contribution
It introduces a reinforcement learning approach for shallow neural networks to master Schnapsen, outperforming supervised imitation and strong search-based agents.
Findings
Reinforcement learning agents outperform supervised imitation agents.
Combining learned value functions with deeper lookahead improves performance.
Optimal training sample size varies depending on the setting.
Abstract
This paper investigates whether shallow neural network agents can master the card game Schnapsen and challenge a strong search-based baseline, RdeepBot, which uses Monte Carlo sampling and lookahead search. Guided by a progressively more complex experimental design, we first evaluate a supervised learning agent (MLPBot) trained on replay data and then a reinforcement learning agent (RLBot) with the same shallow architecture trained through asynchronous Monte Carlo updates and experience replay. The results show that supervised imitation does not generalize well enough to defeat strong RdeepBot opponents, whereas reinforcement learning produces substantially stronger agents. In the setting that focuses on the depth parameter of RdeepBot, the best performance is achieved when the learned value function is combined with deeper lookahead during gameplay, allowing RLBot to achieve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
