Using Approximate Models in Robot Learning
Ali Lenjani

TL;DR
This paper improves an approximate model-based reinforcement learning algorithm for robot trajectory control, enhancing convergence guarantees and validating its efficiency through simulations and comparisons with human experts.
Contribution
The paper introduces modifications to an existing approximate model algorithm, providing stronger convergence guarantees and demonstrating improved performance in simulated robot control tasks.
Findings
Enhanced convergence properties of the algorithm.
Successful implementation and validation in 2D robot simulation.
Performance comparison showing competitiveness with human experts.
Abstract
Trajectory following is one of the complicated control problems when its dynamics are nonlinear, stochastic and include a large number of parameters. The problem has significant difficulties including a large number of trials required for data collection and a massive volume of computations required to find a closed-loop controller for high dimensional and stochastic domains. For solving this type of problem, if we have an appropriate reward function and dynamics model; finding an optimal control policy is possible by using model-based reinforcement learning and optimal control algorithms. However, defining an accurate dynamics model is not possible for complicated problems. Pieter Abbeel and Andrew Ng recently presented an algorithm that requires only an approximate model and only a small number of real-life trials. This algorithm has broad applicability; however, there are some…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms
