MO-Playground: Massively Parallelized Multi-Objective Reinforcement Learning for Robotics

Neil Janwani; Ellen Novoseller; Vernon J. Lawhern; Maegan Tucker

arXiv:2603.09237·cs.RO·March 11, 2026

MO-Playground: Massively Parallelized Multi-Objective Reinforcement Learning for Robotics

Neil Janwani, Ellen Novoseller, Vernon J. Lawhern, Maegan Tucker

PDF

Open Access

TL;DR

This paper introduces MORLAX and MO-Playground, enabling massively parallelized multi-objective reinforcement learning for robotics, significantly reducing computation time and improving Pareto front quality in complex robotic tasks.

Contribution

The paper presents a GPU-native MORL algorithm and a GPU-accelerated environment suite, enabling rapid approximation of Pareto sets in multi-objective robotics.

Findings

01

Achieved 25-270x speed-ups over CPU methods.

02

Successfully learned Pareto-optimal locomotion policies for BRUCE robot.

03

Demonstrated versatility across multiple realistic objectives.

Abstract

Multi-objective reinforcement learning (MORL) is a powerful tool to learn Pareto-optimal policy families across conflicting objectives. However, unlike traditional RL algorithms, existing MORL algorithms do not effectively leverage large-scale parallelization to concurrently simulate thousands of environments, resulting in vastly increased computation time. Ultimately, this has limited MORL's application towards complex multi-objective robotics problems. To address these challenges, we present 1) MORLAX, a new GPU-native, fast MORL algorithm, and 2) MO-Playground, a pip-installable playground of GPU-accelerated multi-objective environments. Together, MORLAX and MO-Playground approximate Pareto sets within minutes, offering 25-270x speed-ups compared to legacy CPU-based approaches whilst achieving superior Pareto front hypervolumes. We demonstrate the versatility of our approach by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Multi-Objective Optimization Algorithms · Robot Manipulation and Learning