Using Approximate Models in Robot Learning

Ali Lenjani

arXiv:1902.04696·cs.RO·February 14, 2019

Using Approximate Models in Robot Learning

Ali Lenjani

PDF

Open Access

TL;DR

This paper improves an approximate model-based reinforcement learning algorithm for robot trajectory control, enhancing convergence guarantees and validating its efficiency through simulations and comparisons with human experts.

Contribution

The paper introduces modifications to an existing approximate model algorithm, providing stronger convergence guarantees and demonstrating improved performance in simulated robot control tasks.

Findings

01

Enhanced convergence properties of the algorithm.

02

Successful implementation and validation in 2D robot simulation.

03

Performance comparison showing competitiveness with human experts.

Abstract

Trajectory following is one of the complicated control problems when its dynamics are nonlinear, stochastic and include a large number of parameters. The problem has significant difficulties including a large number of trials required for data collection and a massive volume of computations required to find a closed-loop controller for high dimensional and stochastic domains. For solving this type of problem, if we have an appropriate reward function and dynamics model; finding an optimal control policy is possible by using model-based reinforcement learning and optimal control algorithms. However, defining an accurate dynamics model is not possible for complicated problems. Pieter Abbeel and Andrew Ng recently presented an algorithm that requires only an approximate model and only a small number of real-life trials. This algorithm has broad applicability; however, there are some…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms