Conflict-Averse Gradient Aggregation for Constrained Multi-Objective   Reinforcement Learning

Dohyeong Kim; Mineui Hong; Jeongho Park; Songhwai Oh

arXiv:2403.00282·cs.LG·June 3, 2024·1 cites

Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning

Dohyeong Kim, Mineui Hong, Jeongho Park, Songhwai Oh

PDF

Open Access 1 Video

TL;DR

This paper introduces CoMOGA, a gradient aggregation method for constrained multi-objective reinforcement learning that avoids gradient conflicts, ensuring stable training and constraint satisfaction across tasks.

Contribution

The paper proposes a simple yet effective gradient aggregation approach for constrained multi-objective RL that guarantees convergence and handles safety constraints.

Findings

01

Prevents gradient conflicts in multi-objective RL.

02

Ensures constraint satisfaction in experiments.

03

Guarantees optimal convergence in tabular settings.

Abstract

In many real-world applications, a reinforcement learning (RL) agent should consider multiple objectives and adhere to safety guidelines. To address these considerations, we propose a constrained multi-objective RL algorithm named Constrained Multi-Objective Gradient Aggregator (CoMOGA). In the field of multi-objective optimization, managing conflicts between the gradients of the multiple objectives is crucial to prevent policies from converging to local optima. It is also essential to efficiently handle safety constraints for stable training and constraint satisfaction. We address these challenges straightforwardly by treating the maximization of multiple objectives as a constrained optimization problem (COP), where the constraints are defined to improve the original objectives. Existing safety constraints are then integrated into the COP, and the policy is updated using a linear…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Conflict-Averse Gradient Aggregation for Constrained Multi-Objective Reinforcement Learning· slideslive

Taxonomy

TopicsSmart Parking Systems Research · Reinforcement Learning in Robotics · Optimization and Variational Analysis