A Reinforcement Learning Approach to the View Planning Problem

Mustafa Devrim Kaba; Mustafa Gokhan Uzunbas; Ser Nam Lim

arXiv:1610.06204·cs.CV·November 21, 2016

A Reinforcement Learning Approach to the View Planning Problem

Mustafa Devrim Kaba, Mustafa Gokhan Uzunbas, Ser Nam Lim

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning method for the view planning problem, aiming to generate minimal view sequences for complete 3D object sensing, outperforming traditional greedy algorithms.

Contribution

The paper proposes a novel RL-based approach with a geometry-exploiting score function and models VPP as an MDP, improving over greedy algorithms.

Findings

01

RL method outperforms greedy algorithm in most cases

02

Uses SARSA, Watkins-Q, and TD algorithms with function approximation

03

Effective in generating minimal view sequences for 3D models

Abstract

We present a Reinforcement Learning (RL) solution to the view planning problem (VPP), which generates a sequence of view points that are capable of sensing all accessible area of a given object represented as a 3D model. In doing so, the goal is to minimize the number of view points, making the VPP a class of set covering optimization problem (SCOP). The SCOP is NP-hard, and the inapproximability results tell us that the greedy algorithm provides the best approximation that runs in polynomial time. In order to find a solution that is better than the greedy algorithm, (i) we introduce a novel score function by exploiting the geometry of the 3D model, (ii) we model an intuitive human approach to VPP using this score function, and (iii) we cast VPP as a Markovian Decision Process (MDP), and solve the MDP in RL framework using well-known RL algorithms. In particular, we use SARSA, Watkins-Q…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Robotics and Sensor-Based Localization · Optimization and Search Problems