How language models extrapolate outside the training data: A case study in Textualized Gridworld
Doyoung Kim, Jongwon Lee, Jinho Park, Minjoon Seo

TL;DR
This paper investigates how language models extrapolate in complex environments using a textualized Gridworld, revealing limitations of conventional methods and proposing cognitive maps inspired by human cognition to improve extrapolation capabilities.
Contribution
Introduces a novel cognitive map framework for path planning in language models, inspired by human mental representations, to enhance extrapolation in unseen environments.
Findings
Cognitive maps improve extrapolation to larger, unseen environments.
Conventional methods like Chain of Thought fail in larger, novel environments.
Cognitive maps exhibit humanlike mental simulation and rapid adaptation.
Abstract
Language models' ability to extrapolate learned behaviors to novel, more complex environments beyond their training scope is highly unknown. This study introduces a path planning task in a textualized Gridworld to probe language models' extrapolation capabilities. We show that conventional approaches, including next token prediction and Chain of Thought (CoT) finetuning, fail to extrapolate in larger, unseen environments. Inspired by human cognition and dual process theory, we propose cognitive maps for path planning, a novel CoT framework that simulates humanlike mental representations. Our experiments show that cognitive maps not only enhance extrapolation to unseen environments but also exhibit humanlike characteristics through structured mental simulation and rapid adaptation. Our finding that these cognitive maps require specialized training schemes and cannot be induced through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Cognitive Science and Mapping · Speech and dialogue systems
