MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning

Yuxin Liu; Ziang Ye; Yueqing Sun; Mingye Zhu; Jinwei Xiao; Zhuowen Han; Qi GU; Xunliang Cai; Lei Zhang

arXiv:2605.13037·cs.AI·May 14, 2026

MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning

Yuxin Liu, Ziang Ye, Yueqing Sun, Mingye Zhu, Jinwei Xiao, Zhuowen Han, Qi GU, Xunliang Cai, Lei Zhang

PDF

TL;DR

The paper introduces MAP, a paradigm that shifts environment understanding before action execution in interactive agents, leading to improved performance across benchmarks.

Contribution

It proposes a novel Map-then-Act framework inspired by human cognition, with three stages: exploration, mapping, and knowledge-augmented execution.

Findings

01

MAP surpasses baseline performance in 22 of 25 game environments.

02

Training on MAP-2K dataset outperforms expert traces.

03

MAP achieves consistent gains across different benchmarks and LLMs.

Abstract

Current interactive LLM agents rely on goal-conditioned stepwise planning, where environmental understanding is acquired reactively during execution rather than established beforehand. This temporal inversion leads to Delayed Environmental Perception: agents must infer environmental constraints through trial-and-error, resulting in an Epistemic Bottleneck that traps them in inefficient failure cycles. Inspired by human affordance perception and cognitive map theory, we propose the Map-then-Act Paradigm (MAP), a plug-and-play framework that shifts environment understanding before execution. MAP consists of three stages: (1) Global Exploration, acquiring environment-general priors; (2) Task-Specific Mapping, constructing a structured cognitive map; and (3) Knowledge-Augmented Execution, solving tasks grounded on the map. Experiments show consistent gains across benchmarks and LLMs. On…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.