Closed-Loop Learning of Visual Control Policies
S. R. Jodogne, J. H. Piater

TL;DR
This paper introduces a flexible, feature-based framework for learning visual control policies through interaction, combining classifiers, reinforcement learning, and hierarchical features to address perceptual aliasing and overfitting.
Contribution
It presents a novel approach integrating feature-based classifiers with reinforcement learning and hierarchical features for visual control policy learning.
Findings
Successfully applied to three visual navigation tasks
Effective in disambiguating aliased states
Addresses overfitting in greedy feature selection
Abstract
In this paper we present a general, flexible framework for learning mappings from images to actions by interacting with the environment. The basic idea is to introduce a feature-based image classifier in front of a reinforcement learning algorithm. The classifier partitions the visual space according to the presence or absence of few highly informative local descriptors that are incrementally selected in a sequence of attempts to remove perceptual aliasing. We also address the problem of fighting overfitting in such a greedy algorithm. Finally, we show how high-level visual features can be generated when the power of local descriptors is insufficient for completely disambiguating the aliased states. This is done by building a hierarchy of composite features that consist of recursive spatial combinations of visual features. We demonstrate the efficacy of our algorithms by solving three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
