Multi-objective Reinforcement Learning with Continuous Pareto Frontier Approximation Supplementary Material
Matteo Pirotta, Simone Parisi, Marcello Restelli

TL;DR
This paper introduces a gradient-based method for approximating the continuous Pareto frontier in multi-objective reinforcement learning, enabling efficient generation of near-optimal solutions with a single optimization run.
Contribution
It proposes a novel policy-gradient approach that optimizes a manifold in policy space to approximate the Pareto frontier continuously, improving over previous multi-solution methods.
Findings
Effective approximation of Pareto frontiers demonstrated on MOMDPs
Single-gradient run approach reduces computational complexity
Method provides a continuous and improved Pareto frontier approximation
Abstract
This document contains supplementary material for the paper "Multi-objective Reinforcement Learning with Continuous Pareto Frontier Approximation", published at the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI-15). The paper is about learning a continuous approximation of the Pareto frontier in Multi-Objective Markov Decision Problems (MOMDPs). We propose a policy-based approach that exploits gradient information to generate solutions close to the Pareto ones. Differently from previous policy-gradient multi-objective algorithms, where n optimization routines are use to have n solutions, our approach performs a single gradient-ascent run that at each step generates an improved continuous approximation of the Pareto frontier. The idea is to exploit a gradient-based approach to optimize the parameters of a function that defines a manifold in the policy parameter space so…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Multi-Objective Optimization Algorithms · Reinforcement Learning in Robotics · Advanced Control Systems Optimization
