Learning Long-Horizon Robot Manipulation Skills via Privileged Action

Xiaofeng Mao; Yucheng Xu; Zhaole Sun; Elle Miller; Daniel Layeghi,; Michael Mistry

arXiv:2502.15442·cs.RO·February 24, 2025

Learning Long-Horizon Robot Manipulation Skills via Privileged Action

Xiaofeng Mao, Yucheng Xu, Zhaole Sun, Elle Miller, Daniel Layeghi,, Michael Mistry

PDF

TL;DR

This paper introduces a structured framework using privileged actions and curriculum learning in simulation to efficiently train robot manipulation skills for complex long-horizon tasks, with successful transfer to real-world scenarios.

Contribution

The paper presents a novel simulation-based training method leveraging privileged actions and curriculum learning, enabling efficient acquisition of complex manipulation skills without extensive reward engineering.

Findings

01

Successfully trained policies for multi-stage long-horizon tasks

02

Achieved robust and diverse behaviors across environments

03

Demonstrated transferability of skills to real-world robots

Abstract

Long-horizon contact-rich tasks are challenging to learn with reinforcement learning, due to ineffective exploration of high-dimensional state spaces with sparse rewards. The learning process often gets stuck in local optimum and demands task-specific reward fine-tuning for complex scenarios. In this work, we propose a structured framework that leverages privileged actions with curriculum learning, enabling the policy to efficiently acquire long-horizon skills without relying on extensive reward engineering or reference trajectories. Specifically, we use privileged actions in simulation with a general training procedure that would be infeasible to implement in real-world scenarios. These privileges include relaxed constraints and virtual forces that enhance interaction and exploration with objects. Our results successfully achieve complex multi-stage long-horizon tasks that naturally…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.