Reinforcement learning with world model
Jingbin Liu, Xinyang Gu, Shuai Liu

TL;DR
This paper introduces a reinforcement learning framework combining a world model with Soft Actor-Critic to enhance sample efficiency, memory, and prediction, achieving state-of-the-art results and adaptability to POMDPs.
Contribution
The paper proposes a novel agent framework integrating world model learning with off-policy RL, improving efficiency, stability, and extending to POMDPs.
Findings
Achieved new state-of-the-art results on benchmark tasks.
Maintained high sample efficiency and training stability.
Extended framework to partially observable environments without performance loss.
Abstract
Nowadays, model-free reinforcement learning algorithms have achieved remarkable performance on many decision making and control tasks, but high sample complexity and low sample efficiency still hinder the wide use of model-free reinforcement learning algorithms. In this paper, we argue that if we intend to design an intelligent agent that learns fast and transfers well, the agent must be able to reflect key elements of intelligence, like intuition, Memory, PredictionandCuriosity. We propose an agent framework that integrates off-policy reinforcement learning with world model learning, so as to embody the important features of intelligence in our algorithm design. We adopt the state-of-art model-free reinforcement learning algorithm, Soft Actor-Critic, as the agent intuition, and world model learning through RNN to endow the agent with memory, curiosity, and the ability to predict. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
