Model-Ensemble Trust-Region Policy Optimization
Thanard Kurutach, Ignasi Clavera, Yan Duan, Aviv Tamar, and Pieter, Abbeel

TL;DR
This paper introduces ME-TRPO, a model-ensemble approach for reinforcement learning that reduces sample complexity and stabilizes training by managing model uncertainty, outperforming traditional model-free methods.
Contribution
The paper proposes a novel ensemble-based method, ME-TRPO, that improves stability and efficiency in deep model-based reinforcement learning.
Findings
Significantly reduces sample complexity on continuous control tasks.
Ensemble models effectively manage uncertainty and stabilize training.
Likelihood ratio derivatives outperform backpropagation through time in stability.
Abstract
Model-free reinforcement learning (RL) methods are succeeding in a growing number of tasks, aided by recent advances in deep learning. However, they tend to suffer from high sample complexity, which hinders their use in real-world domains. Alternatively, model-based reinforcement learning promises to reduce sample complexity, but tends to require careful tuning and to date have succeeded mainly in restrictive domains where simple models are sufficient for learning. In this paper, we analyze the behavior of vanilla model-based reinforcement learning methods when deep neural networks are used to learn both the model and the policy, and show that the learned policy tends to exploit regions where insufficient data is available for the model to be learned, causing instability in training. To overcome this issue, we propose to use an ensemble of models to maintain the model uncertainty and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Fuel Cells and Related Materials
