Curriculum Reinforcement Learning via Morphology-Environment   Co-Evolution

Shuang Ao; Tianyi Zhou; Guodong Long; Xuan Song; Jing Jiang

arXiv:2309.12529·cs.AI·September 25, 2023·1 cites

Curriculum Reinforcement Learning via Morphology-Environment Co-Evolution

Shuang Ao, Tianyi Zhou, Guodong Long, Xuan Song, Jing Jiang

PDF

Open Access

TL;DR

This paper introduces a co-evolution approach for reinforcement learning where both agent morphology and environment are optimized simultaneously, creating an automatic curriculum that enhances generalization to new, unseen environments.

Contribution

The paper proposes a novel morphology-environment co-evolution framework with automatic policy-driven updates, improving RL generalization without manual curriculum design.

Findings

01

Co-evolution significantly improves generalization in unseen environments.

02

Automatic environment and morphology updates outperform static or manually designed curricula.

03

The approach adapts morphology and environment dynamically based on learning progress.

Abstract

Throughout long history, natural species have learned to survive by evolving their physical structures adaptive to the environment changes. In contrast, current reinforcement learning (RL) studies mainly focus on training an agent with a fixed morphology (e.g., skeletal structure and joint attributes) in a fixed environment, which can hardly generalize to changing environments or new tasks. In this paper, we optimize an RL agent and its morphology through ``morphology-environment co-evolution (MECE)'', in which the morphology keeps being updated to adapt to the changing environment, while the environment is modified progressively to bring new challenges and stimulate the improvement of the morphology. This leads to a curriculum to train generalizable RL, whose morphology and policy are optimized for different environments. Instead of hand-crafting the curriculum, we train two policies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsFocus