Learning World Models for Unconstrained Goal Navigation
Yuanlin Duan, Wensen Mao, He Zhu

TL;DR
This paper introduces MUN, a goal-directed exploration algorithm that enhances world models in reinforcement learning, enabling agents to better generalize navigation policies across diverse goals and states.
Contribution
The paper presents MUN, a novel algorithm that improves world model generalization for goal navigation by modeling transitions between arbitrary subgoal states.
Findings
MUN significantly improves navigation success across various goals.
Enhanced world model reliability leads to better policy generalization.
Experimental results show superior performance over existing methods.
Abstract
Learning world models offers a promising avenue for goal-conditioned reinforcement learning with sparse rewards. By allowing agents to plan actions or exploratory goals without direct interaction with the environment, world models enhance exploration efficiency. The quality of a world model hinges on the richness of data stored in the agent's replay buffer, with expectations of reasonable generalization across the state space surrounding recorded trajectories. However, challenges arise in generalizing learned world models to state transitions backward along recorded trajectories or between states across different trajectories, hindering their ability to accurately model real-world dynamics. To address these challenges, we introduce a novel goal-directed exploration algorithm, MUN (short for "World Models for Unconstrained Goal Navigation"). This algorithm is capable of modeling state…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsFormal Methods in Verification · Fuzzy Logic and Control Systems · AI-based Problem Solving and Planning
