Push Smarter, Not Harder: Hierarchical RL-Diffusion Policy for Efficient Nonprehensile Manipulation

Steven Caro; Stephen L. Smith

arXiv:2512.10099·cs.RO·December 12, 2025

Push Smarter, Not Harder: Hierarchical RL-Diffusion Policy for Efficient Nonprehensile Manipulation

Steven Caro, Stephen L. Smith

PDF

Open Access

TL;DR

This paper introduces HeRD, a hierarchical RL-diffusion policy that improves nonprehensile pushing tasks by combining high-level goal selection with low-level trajectory generation, leading to better success and efficiency.

Contribution

The work presents a novel hierarchical framework integrating reinforcement learning and diffusion models for nonprehensile manipulation tasks.

Findings

01

Outperforms state-of-the-art in success rate and path efficiency

02

Demonstrates strong generalization across environments

03

Combines RL and diffusion models effectively

Abstract

Nonprehensile manipulation, such as pushing objects across cluttered environments, presents a challenging control problem due to complex contact dynamics and long-horizon planning requirements. In this work, we propose HeRD, a hierarchical reinforcement learning-diffusion policy that decomposes pushing tasks into two levels: high-level goal selection and low-level trajectory generation. We employ a high-level reinforcement learning (RL) agent to select intermediate spatial goals, and a low-level goal-conditioned diffusion model to generate feasible, efficient trajectories to reach them. This architecture combines the long-term reward maximizing behaviour of RL with the generative capabilities of diffusion models. We evaluate our method in a 2D simulation environment and show that it outperforms the state-of-the-art baseline in success rate, path efficiency, and generalization across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Robotic Path Planning Algorithms