Actor-Critic for Linearly-Solvable Continuous MDP with Partially Known   Dynamics

Tomoki Nishi; Prashant Doshi; Michael R. James; Danil; Prokhorov

arXiv:1706.01077·cs.AI·June 6, 2017·5 cites

Actor-Critic for Linearly-Solvable Continuous MDP with Partially Known Dynamics

Tomoki Nishi, Prashant Doshi, Michael R. James, Danil, Prokhorov

PDF

Open Access

TL;DR

This paper introduces an actor-critic reinforcement learning method for linearly-solvable continuous MDPs that learns with partial knowledge of system dynamics without active exploration, improving policy performance.

Contribution

It presents a novel actor-critic RL approach for L-MDPs that does not require modeling uncontrolled dynamics or transition noise, only control dynamics, suitable for real-world applications.

Findings

01

Improved learning efficiency demonstrated on synthetic problems

02

Effective policy performance in simulated traffic data

03

No need for active exploration or full system modeling

Abstract

In many robotic applications, some aspects of the system dynamics can be modeled accurately while others are difficult to obtain or model. We present a novel reinforcement learning (RL) method for continuous state and action spaces that learns with partial knowledge of the system and without active exploration. It solves linearly-solvable Markov decision processes (L-MDPs), which are well suited for continuous state and action spaces, based on an actor-critic architecture. Compared to previous RL methods for L-MDPs and path integral methods which are model based, the actor-critic learning does not need a model of the uncontrolled dynamics and, importantly, transition noise levels; however, it requires knowing the control dynamics for the problem. We evaluate our method on two synthetic test problems, and one real-world problem in simulation and using real traffic data. Our experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Optimization and Search Problems · Distributed Control Multi-Agent Systems