Multi-objective Reinforcement Learning with Continuous Pareto Frontier   Approximation Supplementary Material

Matteo Pirotta; Simone Parisi; Marcello Restelli

arXiv:1406.3497·cs.AI·November 20, 2014·2 cites

Multi-objective Reinforcement Learning with Continuous Pareto Frontier Approximation Supplementary Material

Matteo Pirotta, Simone Parisi, Marcello Restelli

PDF

Open Access

TL;DR

This paper introduces a gradient-based method for approximating the continuous Pareto frontier in multi-objective reinforcement learning, enabling efficient generation of near-optimal solutions with a single optimization run.

Contribution

It proposes a novel policy-gradient approach that optimizes a manifold in policy space to approximate the Pareto frontier continuously, improving over previous multi-solution methods.

Findings

01

Effective approximation of Pareto frontiers demonstrated on MOMDPs

02

Single-gradient run approach reduces computational complexity

03

Method provides a continuous and improved Pareto frontier approximation

Abstract

This document contains supplementary material for the paper "Multi-objective Reinforcement Learning with Continuous Pareto Frontier Approximation", published at the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI-15). The paper is about learning a continuous approximation of the Pareto frontier in Multi-Objective Markov Decision Problems (MOMDPs). We propose a policy-based approach that exploits gradient information to generate solutions close to the Pareto ones. Differently from previous policy-gradient multi-objective algorithms, where n optimization routines are use to have n solutions, our approach performs a single gradient-ascent run that at each step generates an improved continuous approximation of the Pareto frontier. The idea is to exploit a gradient-based approach to optimize the parameters of a function that defines a manifold in the policy parameter space so…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Multi-Objective Optimization Algorithms · Reinforcement Learning in Robotics · Advanced Control Systems Optimization