Closed-Loop Learning of Visual Control Policies

S. R. Jodogne; J. H. Piater

arXiv:1110.2210·cs.CV·October 12, 2011

Closed-Loop Learning of Visual Control Policies

S. R. Jodogne, J. H. Piater

PDF

TL;DR

This paper introduces a flexible, feature-based framework for learning visual control policies through interaction, combining classifiers, reinforcement learning, and hierarchical features to address perceptual aliasing and overfitting.

Contribution

It presents a novel approach integrating feature-based classifiers with reinforcement learning and hierarchical features for visual control policy learning.

Findings

01

Successfully applied to three visual navigation tasks

02

Effective in disambiguating aliased states

03

Addresses overfitting in greedy feature selection

Abstract

In this paper we present a general, flexible framework for learning mappings from images to actions by interacting with the environment. The basic idea is to introduce a feature-based image classifier in front of a reinforcement learning algorithm. The classifier partitions the visual space according to the presence or absence of few highly informative local descriptors that are incrementally selected in a sequence of attempts to remove perceptual aliasing. We also address the problem of fighting overfitting in such a greedy algorithm. Finally, we show how high-level visual features can be generated when the power of local descriptors is insufficient for completely disambiguating the aliased states. This is done by building a hierarchy of composite features that consist of recursive spatial combinations of visual features. We demonstrate the efficacy of our algorithms by solving three…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.