MetaWorld: Skill Transfer and Composition in a Hierarchical World Model for Grounding High-Level Instructions

Yutong Shen; Hangxu Liu; Kailin Pei; Ruizhe Xia; Tongtong Feng

arXiv:2601.17507·cs.RO·January 27, 2026

MetaWorld: Skill Transfer and Composition in a Hierarchical World Model for Grounding High-Level Instructions

Yutong Shen, Hangxu Liu, Kailin Pei, Ruizhe Xia, Tongtong Feng

PDF

Open Access

TL;DR

MetaWorld introduces a hierarchical world model that enhances humanoid robot skill transfer and composition by integrating semantic planning with physical control, improving efficiency and generalization in complex tasks.

Contribution

The paper presents MetaWorld, a novel hierarchical framework that combines semantic and physical models, enabling efficient skill transfer and better task generalization for humanoid robots.

Findings

01

Outperforms existing world model-based RL in task completion

02

Achieves higher motion coherence in humanoid tasks

03

Enables efficient online adaptation through expert policy transfer

Abstract

Humanoid robot loco-manipulation remains constrained by the semantic-physical gap. Current methods face three limitations: Low sample efficiency in reinforcement learning, poor generalization in imitation learning, and physical inconsistency in VLMs. We propose MetaWorld, a hierarchical world model that integrates semantic planning and physical control via expert policy transfer. The framework decouples tasks into a VLM-driven semantic layer and a latent dynamics model operating in a compact state space. Our dynamic expert selection and motion prior fusion mechanism leverages a pre-trained multi-expert policy library as transferable knowledge, enabling efficient online adaptation via a two-stage framework. VLMs serve as semantic interfaces, mapping instructions to executable skills and bypassing symbol grounding. Experiments on Humanoid-Bench show MetaWorld outperforms world model-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Social Robot Interaction and HRI