Metric State Space Reinforcement Learning for a Vision-Capable Mobile   Robot

Viktor Zhumatiy; Faustino Gomez; Marcus Hutter; Juergen; Schmidhuber

arXiv:cs/0603023·cs.RO·May 23, 2007·3 cites

Metric State Space Reinforcement Learning for a Vision-Capable Mobile Robot

Viktor Zhumatiy, Faustino Gomez, Marcus Hutter, Juergen, Schmidhuber

PDF

Open Access

TL;DR

This paper introduces a novel metric-based reinforcement learning algorithm for vision-capable mobile robots that learns directly from real-world interaction without manual discretization or models, enabling efficient autonomous control.

Contribution

It extends McCallum's Nearest-Sequence Memory algorithm to general metrics over trajectories, allowing learning in continuous, partially observable environments without manual discretization.

Findings

01

Successfully applied on a real mobile robot

02

Learns efficiently with less experience

03

Handles partial observability and continuous perceptual spaces

Abstract

We address the problem of autonomously learning controllers for vision-capable mobile robots. We extend McCallum's (1995) Nearest-Sequence Memory algorithm to allow for general metrics over state-action trajectories. We demonstrate the feasibility of our approach by successfully running our algorithm on a real mobile robot. The algorithm is novel and unique in that it (a) explores the environment and learns directly on a mobile robot without using a hand-made computer model as an intermediate step, (b) does not require manual discretization of the sensor input space, (c) works in piecewise continuous perceptual spaces, and (d) copes with partial observability. Together this allows learning from much less experience compared to previous methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Advanced Control Systems Optimization