Visuomotor Mechanical Search: Learning to Retrieve Target Objects in   Clutter

Andrey Kurenkov; Joseph Taglic; Rohun Kulkarni; Marcus; Dominguez-Kuhne; Animesh Garg; Roberto Mart\'in-Mart\'in; Silvio Savarese

arXiv:2008.06073·cs.AI·August 17, 2020

Visuomotor Mechanical Search: Learning to Retrieve Target Objects in Clutter

Andrey Kurenkov, Joseph Taglic, Rohun Kulkarni, Marcus, Dominguez-Kuhne, Animesh Garg, Roberto Mart\'in-Mart\'in, Silvio Savarese

PDF

TL;DR

This paper introduces a novel deep reinforcement learning method for robotic object retrieval in cluttered environments, combining teacher guidance, privileged critic information, and mid-level representations to improve learning efficiency and uncovering success.

Contribution

The work presents a new RL approach that enhances sample efficiency and effectiveness in unoccluding objects, outperforming baselines and enabling better grasping in cluttered scenes.

Findings

01

Faster training convergence compared to baselines

02

Improved uncovering efficiency of occluded objects

03

Enhanced graspability of target objects after policy execution

Abstract

When searching for objects in cluttered environments, it is often necessary to perform complex interactions in order to move occluding objects out of the way and fully reveal the object of interest and make it graspable. Due to the complexity of the physics involved and the lack of accurate models of the clutter, planning and controlling precise predefined interactions with accurate outcome is extremely hard, when not impossible. In problems where accurate (forward) models are lacking, Deep Reinforcement Learning (RL) has shown to be a viable solution to map observations (e.g. images) to good interactions in the form of close-loop visuomotor policies. However, Deep RL is sample inefficient and fails when applied directly to the problem of unoccluding objects based on images. In this work we present a novel Deep RL procedure that combines i) teacher-aided exploration, ii) a critic with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.