From Imitation to Interaction: Mastering Game of Schnapsen with Shallow Reinforcement Learning

J\'an Kla\v{c}an; Sizhong Zhang

arXiv:2605.17162·cs.AI·May 19, 2026

From Imitation to Interaction: Mastering Game of Schnapsen with Shallow Reinforcement Learning

J\'an Kla\v{c}an, Sizhong Zhang

PDF

TL;DR

This paper demonstrates that shallow neural network agents trained with reinforcement learning can effectively master the card game Schnapsen and outperform strong search-based baselines, especially when combined with deeper lookahead.

Contribution

It introduces a reinforcement learning approach for shallow neural networks to master Schnapsen, outperforming supervised imitation and strong search-based agents.

Findings

01

Reinforcement learning agents outperform supervised imitation agents.

02

Combining learned value functions with deeper lookahead improves performance.

03

Optimal training sample size varies depending on the setting.

Abstract

This paper investigates whether shallow neural network agents can master the card game Schnapsen and challenge a strong search-based baseline, RdeepBot, which uses Monte Carlo sampling and lookahead search. Guided by a progressively more complex experimental design, we first evaluate a supervised learning agent (MLPBot) trained on replay data and then a reinforcement learning agent (RLBot) with the same shallow architecture trained through asynchronous Monte Carlo updates and experience replay. The results show that supervised imitation does not generalize well enough to defeat strong RdeepBot opponents, whereas reinforcement learning produces substantially stronger agents. In the setting that focuses on the depth parameter of RdeepBot, the best performance is achieved when the learned value function is combined with deeper lookahead during gameplay, allowing RLBot to achieve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.