Beyond Short-Horizon: VQ-Memory for Robust Long-Horizon Manipulation in Non-Markovian Simulation Benchmarks
Honghui Wang, Zhi Jing, Jicong Ao, Shiji Song, Xuelong Li, Gao Huang, Chenjia Bai

TL;DR
This paper introduces RuleSafe, a new non-Markovian manipulation benchmark with diverse locking tasks, and VQ-Memory, a structured temporal encoding method that improves long-horizon planning and generalization in robotic simulation.
Contribution
The paper presents VQ-Memory, a novel vector-quantized temporal representation, and RuleSafe, a complex manipulation benchmark, advancing long-horizon robotic manipulation in non-Markovian settings.
Findings
VQ-Memory improves long-horizon planning accuracy.
Enhances generalization to unseen configurations.
Reduces computational cost in manipulation tasks.
Abstract
The high cost of collecting real-robot data has made robotic simulation a scalable platform for both evaluation and data generation. Yet most existing benchmarks concentrate on simple manipulation tasks such as pick-and-place, failing to capture the non-Markovian characteristics of real-world tasks and the complexity of articulated object interactions. To address this limitation, we present RuleSafe, a new articulated manipulation benchmark built upon a scalable LLM-aided simulation framework. RuleSafe features safes with diverse unlocking mechanisms, such as key locks, password locks, and logic locks, which require different multi-stage reasoning and manipulation strategies. These LLM-generated rules produce non-Markovian and long-horizon tasks that require temporal modeling and memory-based reasoning. We further propose VQ-Memory, a compact and structured temporal representation that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Autonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics
