Training Cross-Morphology Embodied AI Agents: From Practical Challenges to Theoretical Foundations
Shaoshan Liu, Fan Wang, Hongjun Zhou, Yuanfeng Wang

TL;DR
This paper introduces the HEAT problem for training diverse embodied AI agents, analyzes its computational complexity, and proposes a distributed learning approach inspired by biological systems to improve scalability and robustness.
Contribution
It formalizes the HEAT problem, proves its computational hardness, and explores collective adaptation as a scalable solution for cross-morphology embodied AI training.
Findings
HEAT reduces to a PSPACE-complete POMDP, explaining training difficulties.
Distributed collective adaptation is NEXP-complete but practically scalable.
Theoretical insights guide the design of more robust embodied AI systems.
Abstract
While theory and practice are often seen as separate domains, this article shows that theoretical insight is essential for overcoming real-world engineering barriers. We begin with a practical challenge: training a cross-morphology embodied AI policy that generalizes across diverse robot morphologies. We formalize this as the Heterogeneous Embodied Agent Training (HEAT) problem and prove it reduces to a structured Partially Observable Markov Decision Process (POMDP) that is PSPACE-complete. This result explains why current reinforcement learning pipelines break down under morphological diversity, due to sequential training constraints, memory-policy coupling, and data incompatibility. We further explore Collective Adaptation, a distributed learning alternative inspired by biological systems. Though NEXP-complete in theory, it offers meaningful scalability and deployment benefits in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Embodied and Extended Cognition
