Domains as Objectives: Domain-Uncertainty-Aware Policy Optimization   through Explicit Multi-Domain Convex Coverage Set Learning

Wendyam Eric Lionel Ilboudo; Taisuke Kobayashi; Takamitsu Matsubara

arXiv:2410.04719·cs.RO·October 8, 2024

Domains as Objectives: Domain-Uncertainty-Aware Policy Optimization through Explicit Multi-Domain Convex Coverage Set Learning

Wendyam Eric Lionel Ilboudo, Taisuke Kobayashi, Takamitsu Matsubara

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach to optimize uncertainty-aware policies in robotics by framing the problem as a convex coverage set learning task within multi-objective reinforcement learning, improving policy robustness.

Contribution

It reformulates the challenge of optimizing uncertainty-aware policies as a convex coverage set problem in MORL, enabling more efficient training of policies that handle domain uncertainty.

Findings

01

MORL algorithms can effectively optimize uncertainty-aware policies.

02

The proposed framework improves policy robustness across domains.

03

Algorithms demonstrate enhanced performance in simulated experiments.

Abstract

The problem of uncertainty is a feature of real world robotics problems and any control framework must contend with it in order to succeed in real applications tasks. Reinforcement Learning is no different, and epistemic uncertainty arising from model uncertainty or misspecification is a challenge well captured by the sim-to-real gap. A simple solution to this issue is domain randomization (DR), which unfortunately can result in conservative agents. As a remedy to this conservativeness, the use of universal policies that take additional information about the randomized domain has risen as an alternative solution, along with recurrent neural network-based controllers. Uncertainty-aware universal policies present a particularly compelling solution able to account for system identification uncertainties during deployment. In this paper, we reveal that the challenge of efficiently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mahoumaru/mdrl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics