Accelerating Model-Based Reinforcement Learning using Non-Linear Trajectory Optimization

Marco Cal\`i; Giulio Giacomuzzo; Ruggero Carli; Alberto Dalla Libera

arXiv:2506.02767·cs.LG·June 4, 2025

Accelerating Model-Based Reinforcement Learning using Non-Linear Trajectory Optimization

Marco Cal\`i, Giulio Giacomuzzo, Ruggero Carli, Alberto Dalla Libera

PDF

Open Access

TL;DR

This paper introduces EB-MC-PILCO, a method that combines MC-PILCO with iLQR to accelerate policy optimization in model-based reinforcement learning, demonstrated on the cart-pole task.

Contribution

It integrates iLQR into MC-PILCO to improve convergence speed and exploration efficiency in nonlinear control tasks.

Findings

01

Up to 45.9% reduction in execution time.

02

Maintains 100% success rate across trials.

03

Faster convergence even with fewer iterations.

Abstract

This paper addresses the slow policy optimization convergence of Monte Carlo Probabilistic Inference for Learning Control (MC-PILCO), a state-of-the-art model-based reinforcement learning (MBRL) algorithm, by integrating it with iterative Linear Quadratic Regulator (iLQR), a fast trajectory optimization method suitable for nonlinear systems. The proposed method, Exploration-Boosted MC-PILCO (EB-MC-PILCO), leverages iLQR to generate informative, exploratory trajectories and initialize the policy, significantly reducing the number of required optimization steps. Experiments on the cart-pole task demonstrate that EB-MC-PILCO accelerates convergence compared to standard MC-PILCO, achieving up to $45.9%$ reduction in execution time when both methods solve the task in four trials. EB-MC-PILCO also maintains a $100%$ success rate across trials while solving the task faster, even in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Autonomous Vehicle Technology and Safety · Reinforcement Learning in Robotics