Combining Benefits from Trajectory Optimization and Deep Reinforcement   Learning

Guillaume Bellegarda; Katie Byl

arXiv:1910.09667·cs.RO·October 23, 2019·5 cites

Combining Benefits from Trajectory Optimization and Deep Reinforcement Learning

Guillaume Bellegarda, Katie Byl

PDF

Open Access

TL;DR

This paper proposes a hybrid approach combining trajectory optimization and deep reinforcement learning to enhance sample efficiency and provide safety guarantees for robotic control, demonstrated on a car model.

Contribution

It introduces a method that integrates trajectory optimization with reinforcement learning to reduce sample complexity and estimate worst-case performance bounds.

Findings

01

Reduced RL sample complexity using optimal control knowledge

02

Provided upper bound estimates for time-to-arrival during training

03

Demonstrated applicability to mobile robotic systems

Abstract

Recent breakthroughs both in reinforcement learning and trajectory optimization have made significant advances towards real world robotic system deployment. Reinforcement learning (RL) can be applied to many problems without needing any modeling or intuition about the system, at the cost of high sample complexity and the inability to prove any metrics about the learned policies. Trajectory optimization (TO) on the other hand allows for stability and robustness analyses on generated motions and trajectories, but is only as good as the often over-simplified derived model, and may have prohibitively expensive computation times for real-time control. This paper seeks to combine the benefits from these two areas while mitigating their drawbacks by (1) decreasing RL sample complexity by using existing knowledge of the problem with optimal control, and (2) providing an upper bound estimate on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Robotic Path Planning Algorithms