Learning World Models for Unconstrained Goal Navigation

Yuanlin Duan; Wensen Mao; He Zhu

arXiv:2411.02446·cs.LG·November 6, 2024

Learning World Models for Unconstrained Goal Navigation

Yuanlin Duan, Wensen Mao, He Zhu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces MUN, a goal-directed exploration algorithm that enhances world models in reinforcement learning, enabling agents to better generalize navigation policies across diverse goals and states.

Contribution

The paper presents MUN, a novel algorithm that improves world model generalization for goal navigation by modeling transitions between arbitrary subgoal states.

Findings

01

MUN significantly improves navigation success across various goals.

02

Enhanced world model reliability leads to better policy generalization.

03

Experimental results show superior performance over existing methods.

Abstract

Learning world models offers a promising avenue for goal-conditioned reinforcement learning with sparse rewards. By allowing agents to plan actions or exploratory goals without direct interaction with the environment, world models enhance exploration efficiency. The quality of a world model hinges on the richness of data stored in the agent's replay buffer, with expectations of reasonable generalization across the state space surrounding recorded trajectories. However, challenges arise in generalizing learned world models to state transitions backward along recorded trajectories or between states across different trajectories, hindering their ability to accurately model real-world dynamics. To address these challenges, we introduce a novel goal-directed exploration algorithm, MUN (short for "World Models for Unconstrained Goal Navigation"). This algorithm is capable of modeling state…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

RU-Automated-Reasoning-Group/MUN
tfOfficial

Videos

Learning World Models for Unconstrained Goal Navigation· slideslive

Taxonomy

TopicsFormal Methods in Verification · Fuzzy Logic and Control Systems · AI-based Problem Solving and Planning