Inverse reinforcement learning in continuous time and space

Rushikesh Kamalapurkar

arXiv:1801.07663·cs.SY·July 7, 2021

Inverse reinforcement learning in continuous time and space

Rushikesh Kamalapurkar

PDF

TL;DR

This paper introduces a data-driven inverse reinforcement learning method for continuous-time, continuous-space linear systems, enabling online estimation of an agent's cost function from input-output data.

Contribution

It develops a novel output-feedback inverse reinforcement learning approach using simultaneous state and parameter estimation for linear systems.

Findings

01

Successfully estimates cost functions from input-output data

02

Operates online in continuous-time and space settings

03

Achieves estimation up to a multiplicative constant

Abstract

This paper develops a data-driven inverse reinforcement learning technique for a class of linear systems to estimate the cost function of an agent online, using input-output measurements. A simultaneous state and parameter estimator is utilized to facilitate output-feedback inverse reinforcement learning, and cost function estimation is achieved up to multiplication by a constant.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.