Belief Tree Search for Active Object Recognition

Mohsen Malmir; Garrison W. Cottrell

arXiv:1708.03901·cs.AI·August 15, 2017

Belief Tree Search for Active Object Recognition

Mohsen Malmir, Garrison W. Cottrell

PDF

TL;DR

This paper introduces a belief tree search approach for active object recognition, modeling it as a POMDP, and demonstrates improved recognition accuracy and policy optimization through supervised learning and observation function refinement.

Contribution

The paper presents a novel POMDP-based framework for AOR, using belief tree search and LSTM prediction, with a gradient-based method to optimize the observation function.

Findings

01

LSTM-based policy outperforms guided policy search in recognition accuracy.

02

Optimizing the observation function improves total reward and recognition performance.

03

The approach generalizes well to new views and objects.

Abstract

Active Object Recognition (AOR) has been approached as an unsupervised learning problem, in which optimal trajectories for object inspection are not known and are to be discovered by reducing label uncertainty measures or training with reinforcement learning. Such approaches have no guarantees of the quality of their solution. In this paper, we treat AOR as a Partially Observable Markov Decision Process (POMDP) and find near-optimal policies on training data using Belief Tree Search (BTS) on the corresponding belief Markov Decision Process (MDP). AOR then reduces to the problem of knowledge transfer from near-optimal policies on training set to the test set. We train a Long Short Term Memory (LSTM) network to predict the best next action on the training set rollouts. We sho that the proposed AOR method generalizes well to novel views of familiar objects and also to novel objects. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory