Beyond Short-Horizon: VQ-Memory for Robust Long-Horizon Manipulation in Non-Markovian Simulation Benchmarks

Honghui Wang; Zhi Jing; Jicong Ao; Shiji Song; Xuelong Li; Gao Huang; Chenjia Bai

arXiv:2603.09513·cs.RO·March 19, 2026

Beyond Short-Horizon: VQ-Memory for Robust Long-Horizon Manipulation in Non-Markovian Simulation Benchmarks

Honghui Wang, Zhi Jing, Jicong Ao, Shiji Song, Xuelong Li, Gao Huang, Chenjia Bai

PDF

Open Access

TL;DR

This paper introduces RuleSafe, a new non-Markovian manipulation benchmark with diverse locking tasks, and VQ-Memory, a structured temporal encoding method that improves long-horizon planning and generalization in robotic simulation.

Contribution

The paper presents VQ-Memory, a novel vector-quantized temporal representation, and RuleSafe, a complex manipulation benchmark, advancing long-horizon robotic manipulation in non-Markovian settings.

Findings

01

VQ-Memory improves long-horizon planning accuracy.

02

Enhances generalization to unseen configurations.

03

Reduces computational cost in manipulation tasks.

Abstract

The high cost of collecting real-robot data has made robotic simulation a scalable platform for both evaluation and data generation. Yet most existing benchmarks concentrate on simple manipulation tasks such as pick-and-place, failing to capture the non-Markovian characteristics of real-world tasks and the complexity of articulated object interactions. To address this limitation, we present RuleSafe, a new articulated manipulation benchmark built upon a scalable LLM-aided simulation framework. RuleSafe features safes with diverse unlocking mechanisms, such as key locks, password locks, and logic locks, which require different multi-stage reasoning and manipulation strategies. These LLM-generated rules produce non-Markovian and long-horizon tasks that require temporal modeling and memory-based reasoning. We further propose VQ-Memory, a compact and structured temporal representation that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Autonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics