ContextFlow: Hierarchical Task-State Alignment for Long-Horizon Embodied Agents

Shuhan Guo; Kun Zhang; Haifei Liu; Xingyu Gao; Yongqi Zhang; Yaqing Wang; Quanming Yao

arXiv:2605.19314·cs.RO·May 20, 2026

ContextFlow: Hierarchical Task-State Alignment for Long-Horizon Embodied Agents

Shuhan Guo, Kun Zhang, Haifei Liu, Xingyu Gao, Yongqi Zhang, Yaqing Wang, Quanming Yao

PDF

TL;DR

ContextFlow is a framework that maintains task-level consistency in long-horizon embodied agents by explicitly aligning planning, monitoring, and execution stages, improving robustness and transparency.

Contribution

It introduces an inspectable alignment framework that explicitly represents task stages and applies scoped updates to mitigate task-state failures.

Findings

01

Experiments show improved task success rates with ContextFlow.

02

The framework enables explicit diagnosis of task-state failures.

03

Demonstration traces illustrate effective mitigation of recurring issues.

Abstract

Long-horizon embodied agents increasingly delegate navigation, search, approach, and manipulation to specialist executors. As these executors become stronger, the main bottleneck shifts from local skill execution to maintaining a coherent task frontier across planning, monitoring, memory, and execution. We study task-state misalignment, a task-level consistency failure in which the planner's active stage, runtime evidence, remembered context, and delegated executor no longer justify the same next-step decision. This failure can lead to unsupported handoffs, stage lock, executor-context mismatch, and unnecessary replanning. We propose ContextFlow, an inspectable alignment framework that represents stages as explicit contracts, converts runtime observations into evidence packets, and applies scoped updates including continue, refine, transfer, promote, and repair. ContextFlow keeps…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.