Graph-based Cluttered Scene Generation and Interactive Exploration using   Deep Reinforcement Learning

K. Niranjan Kumar; Irfan Essa; Sehoon Ha

arXiv:2109.10460·cs.RO·September 23, 2021

Graph-based Cluttered Scene Generation and Interactive Exploration using Deep Reinforcement Learning

K. Niranjan Kumar, Irfan Essa, Sehoon Ha

PDF

Open Access

TL;DR

This paper presents a deep reinforcement learning framework for generating and exploring cluttered scenes, enabling robots to discover hidden objects efficiently in structured environments like kitchens.

Contribution

It introduces a novel scene grammar and a GNN-based scene generation method, along with an exploration policy that generalizes to real-world cluttered scenes.

Findings

01

Agents outperform baselines in object discovery

02

Effective sim-to-real transfer demonstrated on UR10 robot

03

Scene generation produces diverse stable cluttered scenes

Abstract

We introduce a novel method to teach a robotic agent to interactively explore cluttered yet structured scenes, such as kitchen pantries and grocery shelves, by leveraging the physical plausibility of the scene. We propose a novel learning framework to train an effective scene exploration policy to discover hidden objects with minimal interactions. First, we define a novel scene grammar to represent structured clutter. Then we train a Graph Neural Network (GNN) based Scene Generation agent using deep reinforcement learning (deep RL), to manipulate this Scene Grammar to create a diverse set of stable scenes, each containing multiple hidden objects. Given such cluttered scenes, we then train a Scene Exploration agent, using deep RL, to uncover hidden objects by interactively rearranging the scene. We show that our learned agents hide and discover significantly more objects than the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Human Pose and Action Recognition