TL;DR
This paper explores local forward models for predicting state transitions in unforgiving games like Sokoban, demonstrating their effectiveness and analyzing how neighborhood patterns affect learning performance.
Contribution
It introduces formal definitions and two basic learning approaches for local forward models, showing their ability to generalize in complex, unforgiving game environments.
Findings
Hash Set and Decision Tree models quickly learn to predict state transitions.
Models perform well even on unseen test levels.
Too many attributes in local neighborhoods degrade model performance.
Abstract
This paper examines learning approaches for forward models based on local cell transition functions. We provide a formal definition of local forward models for which we propose two basic learning approaches. Our analysis is based on the game Sokoban, where a wrong action can lead to an unsolvable game state. Therefore, an accurate prediction of an action's resulting state is necessary to avoid this scenario. In contrast to learning the complete state transition function, local forward models allow extracting multiple training examples from a single state transition. In this way, the Hash Set model, as well as the Decision Tree model, quickly learn to predict upcoming state transitions of both the training and the test set. Applying the model using a statistical forward planner showed that the best models can be used to satisfying degree even in cases in which the test levels have not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
