Resolving State Ambiguity in Robot Manipulation via Adaptive Working Memory Recoding

Qingda Hu; Ziheng Qiu; Zijun Xu; Kaizhao Zhang; Xizhou Bu; Zuolei Sun; Bo Zhang; Jieru Zhao; Zhongxue Gan; Wenchao Ding

arXiv:2512.24638·cs.RO·January 1, 2026

Resolving State Ambiguity in Robot Manipulation via Adaptive Working Memory Recoding

Qingda Hu, Ziheng Qiu, Zijun Xu, Kaizhao Zhang, Xizhou Bu, Zuolei Sun, Bo Zhang, Jieru Zhao, Zhongxue Gan, Wenchao Ding

PDF

Open Access

TL;DR

This paper introduces PAM, an adaptive working memory policy for robotic manipulation that effectively resolves state ambiguity by utilizing a hierarchical memory system, enabling robust performance with a large history window and high inference speed.

Contribution

The paper proposes PAM, a novel visuomotor policy with adaptive working memory, supporting large history windows and efficient ambiguity resolution in robotic tasks.

Findings

01

Supports a 300-frame history window with high inference speed

02

Handles multiple state ambiguity scenarios simultaneously

03

Maintains stable training with a 10-second history window

Abstract

State ambiguity is common in robotic manipulation. Identical observations may correspond to multiple valid behavior trajectories. The visuomotor policy must correctly extract the appropriate types and levels of information from the history to identify the current task phase. However, naively extending the history window is computationally expensive and may cause severe overfitting. Inspired by the continuous nature of human reasoning and the recoding of working memory, we introduce PAM, a novel visuomotor Policy equipped with Adaptive working Memory. With minimal additional training cost in a two-stage manner, PAM supports a 300-frame history window while maintaining high inference speed. Specifically, a hierarchical frame feature extractor yields two distinct representations for motion primitives and temporal disambiguation. For compact representation, a context router with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Human Pose and Action Recognition