Learning to Look Around: Intelligently Exploring Unseen Environments for   Unknown Tasks

Dinesh Jayaraman; Kristen Grauman

arXiv:1709.00507·cs.CV·December 22, 2017·1 cites

Learning to Look Around: Intelligently Exploring Unseen Environments for Unknown Tasks

Dinesh Jayaraman, Kristen Grauman

PDF

Open Access 2 Repos

TL;DR

This paper introduces a reinforcement learning approach for visual agents to autonomously explore and observe unseen environments, learning efficient behaviors for scene and shape completion without task-specific training.

Contribution

It presents a novel RL-based method enabling agents to learn general exploration policies that transfer across tasks and environments without manual labeling or recognition-specific training.

Findings

01

Policies effectively reduce uncertainty in unseen environments

02

Approach generalizes to new tasks and environments

03

No manual labeling required for training

Abstract

It is common to implicitly assume access to intelligently captured inputs (e.g., photos from a human photographer), yet autonomously capturing good observations is itself a major challenge. We address the problem of learning to look around: if a visual agent has the ability to voluntarily acquire new views to observe its environment, how can it learn efficient exploratory behaviors to acquire informative observations? We propose a reinforcement learning solution, where the agent is rewarded for actions that reduce its uncertainty about the unobserved portions of its environment. Based on this principle, we develop a recurrent neural network-based approach to perform active completion of panoramic natural scenes and 3D object shapes. Crucially, the learned policies are not tied to any recognition task nor to the particular semantic content seen during training. As a result, 1) the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Human Pose and Action Recognition