Constrained Meta Agnostic Reinforcement Learning

Karam Daaboul; Florian Kuhm; Tim Joseph; J. Marius Zoellner

arXiv:2406.14047·cs.LG·June 21, 2024

Constrained Meta Agnostic Reinforcement Learning

Karam Daaboul, Florian Kuhm, Tim Joseph, J. Marius Zoellner

PDF

Open Access

TL;DR

This paper introduces C-MAML, a meta-learning algorithm that integrates constrained optimization to enable rapid adaptation to new tasks while respecting environmental constraints, demonstrated on robotic locomotion tasks.

Contribution

The paper presents C-MAML, a novel meta-learning framework that incorporates task-specific constraints during training to improve safety and adaptability in real-world environments.

Findings

01

C-MAML achieves faster adaptation to new tasks with constraints.

02

It produces safer initial policies for task learning.

03

Demonstrated effectiveness on simulated robotic locomotion tasks.

Abstract

Meta-Reinforcement Learning (Meta-RL) aims to acquire meta-knowledge for quick adaptation to diverse tasks. However, applying these policies in real-world environments presents a significant challenge in balancing rapid adaptability with adherence to environmental constraints. Our novel approach, Constraint Model Agnostic Meta Learning (C-MAML), merges meta learning with constrained optimization to address this challenge. C-MAML enables rapid and efficient task adaptation by incorporating task-specific constraints directly into its meta-algorithm framework during the training phase. This fusion results in safer initial parameters for learning new tasks. We demonstrate the effectiveness of C-MAML in simulated locomotion with wheeled robot tasks of varying complexity, highlighting its practicality and robustness in dynamic environments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems