Curriculum Reinforcement Learning via Morphology-Environment Co-Evolution
Shuang Ao, Tianyi Zhou, Guodong Long, Xuan Song, Jing Jiang

TL;DR
This paper introduces a co-evolution approach for reinforcement learning where both agent morphology and environment are optimized simultaneously, creating an automatic curriculum that enhances generalization to new, unseen environments.
Contribution
The paper proposes a novel morphology-environment co-evolution framework with automatic policy-driven updates, improving RL generalization without manual curriculum design.
Findings
Co-evolution significantly improves generalization in unseen environments.
Automatic environment and morphology updates outperform static or manually designed curricula.
The approach adapts morphology and environment dynamically based on learning progress.
Abstract
Throughout long history, natural species have learned to survive by evolving their physical structures adaptive to the environment changes. In contrast, current reinforcement learning (RL) studies mainly focus on training an agent with a fixed morphology (e.g., skeletal structure and joint attributes) in a fixed environment, which can hardly generalize to changing environments or new tasks. In this paper, we optimize an RL agent and its morphology through ``morphology-environment co-evolution (MECE)'', in which the morphology keeps being updated to adapt to the changing environment, while the environment is modified progressively to bring new challenges and stimulate the improvement of the morphology. This leads to a curriculum to train generalizable RL, whose morphology and policy are optimized for different environments. Instead of hand-crafting the curriculum, we train two policies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
MethodsFocus
