Hierarchical Policy Learning for Mechanical Search
Oussama Zenkri, Ngo Anh Vien, Gerhard Neumann

TL;DR
This paper introduces a hierarchical reinforcement learning approach for Mechanical Search, significantly improving object retrieval success rates and reducing decision-making time by formulating the problem as a hierarchical POMDP.
Contribution
It formulates Mechanical Search as a hierarchical POMDP and develops sub-policies that enhance success rates and efficiency over rule-based methods.
Findings
Success rate increased from less than 32% to nearly 80%.
Push action computation time reduced from seconds to milliseconds.
Hierarchical policies outperform rule-based approaches.
Abstract
Retrieving objects from clutters is a complex task, which requires multiple interactions with the environment until the target object can be extracted. These interactions involve executing action primitives like grasping or pushing as well as setting priorities for the objects to manipulate and the actions to execute. Mechanical Search (MS) is a framework for object retrieval, which uses a heuristic algorithm for pushing and rule-based algorithms for high-level planning. While rule-based policies profit from human intuition in how they work, they usually perform sub-optimally in many cases. Deep reinforcement learning (RL) has shown great performance in complex tasks such as taking decisions through evaluating pixels, which makes it suitable for training policies in the context of object-retrieval. In this work, we first formulate the MS problem in a principled formulation as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Robotic Path Planning Algorithms
