CORL: A Continuous-state Offset-dynamics Reinforcement Learner
Emma Brunskill, Bethany Leffler, Lihong Li, Michael L. Littman,, Nicholas Roy

TL;DR
This paper introduces CORL, a reinforcement learning algorithm designed for continuous state spaces with stochastic, switching dynamics, demonstrating its effectiveness through theoretical guarantees and a robotic car experiment.
Contribution
The paper presents a novel RL algorithm for continuous, stochastic environments with polynomial sample complexity and integrates fitted value iteration for planning in learned models.
Findings
Algorithm is probably approximately correct with polynomial sample complexity.
Effectively models real-world dynamics in robotic terrain navigation.
Successfully applied to a robotic car experiment.
Abstract
Continuous state spaces and stochastic, switching dynamics characterize a number of rich, realworld domains, such as robot navigation across varying terrain. We describe a reinforcementlearning algorithm for learning in these domains and prove for certain environments the algorithm is probably approximately correct with a sample complexity that scales polynomially with the state-space dimension. Unfortunately, no optimal planning techniques exist in general for such problems; instead we use fitted value iteration to solve the learned MDP, and include the error due to approximate planning in our bounds. Finally, we report an experiment using a robotic car driving over varying terrain to demonstrate that these dynamics representations adequately capture real-world dynamics and that our algorithm can be used to efficiently solve such problems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Evolutionary Algorithms and Applications
