MrCoM: A Meta-Regularized World-Model Generalizing Across Multi-Scenarios

Xuantang Xiong; Ni Mu; Runpeng Xie; Senhao Yang; Yaqing Wang; Lexiang Wang; Yao Luan; Siyuan Li; Shuang Xu; Yiqin Yang; Bo Xu

arXiv:2511.06252·cs.LG·November 11, 2025

MrCoM: A Meta-Regularized World-Model Generalizing Across Multi-Scenarios

Xuantang Xiong, Ni Mu, Runpeng Xie, Senhao Yang, Yaqing Wang, Lexiang Wang, Yao Luan, Siyuan Li, Shuang Xu, Yiqin Yang, Bo Xu

PDF

Open Access

TL;DR

MrCoM introduces a meta-regularized world model that generalizes across multiple scenarios in model-based reinforcement learning, improving prediction accuracy and policy transfer by decomposing state space and regularizing representations.

Contribution

The paper proposes MrCoM, a novel meta-regularized world model that enhances multi-scenario generalization in MBRL through state decomposition and regularization techniques.

Findings

01

Outperforms previous methods in diverse scenarios

02

Theoretically bounds generalization error in multi-scenario settings

03

Demonstrates improved sample efficiency and prediction accuracy

Abstract

Model-based reinforcement learning (MBRL) is a crucial approach to enhance the generalization capabilities and improve the sample efficiency of RL algorithms. However, current MBRL methods focus primarily on building world models for single tasks and rarely address generalization across different scenarios. Building on the insight that dynamics within the same simulation engine share inherent properties, we attempt to construct a unified world model capable of generalizing across different scenarios, named Meta-Regularized Contextual World-Model (MrCoM). This method first decomposes the latent state space into various components based on the dynamic characteristics, thereby enhancing the accuracy of world-model prediction. Further, MrCoM adopts meta-state regularization to extract unified representation of scenario-relevant information, and meta-value regularization to align world-model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Model Reduction and Neural Networks · Domain Adaptation and Few-Shot Learning