Domains as Objectives: Domain-Uncertainty-Aware Policy Optimization through Explicit Multi-Domain Convex Coverage Set Learning
Wendyam Eric Lionel Ilboudo, Taisuke Kobayashi, Takamitsu Matsubara

TL;DR
This paper introduces a novel approach to optimize uncertainty-aware policies in robotics by framing the problem as a convex coverage set learning task within multi-objective reinforcement learning, improving policy robustness.
Contribution
It reformulates the challenge of optimizing uncertainty-aware policies as a convex coverage set problem in MORL, enabling more efficient training of policies that handle domain uncertainty.
Findings
MORL algorithms can effectively optimize uncertainty-aware policies.
The proposed framework improves policy robustness across domains.
Algorithms demonstrate enhanced performance in simulated experiments.
Abstract
The problem of uncertainty is a feature of real world robotics problems and any control framework must contend with it in order to succeed in real applications tasks. Reinforcement Learning is no different, and epistemic uncertainty arising from model uncertainty or misspecification is a challenge well captured by the sim-to-real gap. A simple solution to this issue is domain randomization (DR), which unfortunately can result in conservative agents. As a remedy to this conservativeness, the use of universal policies that take additional information about the randomized domain has risen as an alternative solution, along with recurrent neural network-based controllers. Uncertainty-aware universal policies present a particularly compelling solution able to account for system identification uncertainties during deployment. In this paper, we reveal that the challenge of efficiently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics
