On Assessing the Usefulness of Proxy Domains for Developing and Evaluating Embodied Agents
Anthony Courchesne (1, 2), Andrea Censi (3), Liam Paull (1 and, 2) ((1) Mila, (2) Universit\'e de Montr\'eal, (3) ETH Z\"urich)

TL;DR
This paper introduces new metrics to evaluate the usefulness of proxy domains in developing and assessing embodied agents, distinguishing their predictive accuracy from their utility in learning, especially when real-world data is hard to obtain.
Contribution
The paper proposes the proxy usefulness (PU) metrics, including relative predictive PU and learning PU, to better evaluate and optimize proxy domains for agent development.
Findings
New PU metrics effectively compare proxy domains.
Metrics help optimize proxy parameters for better agent performance.
Clarifies the role of proxy domains in simulation-based learning.
Abstract
In many situations it is either impossible or impractical to develop and evaluate agents entirely on the target domain on which they will be deployed. This is particularly true in robotics, where doing experiments on hardware is much more arduous than in simulation. This has become arguably more so in the case of learning-based agents. To this end, considerable recent effort has been devoted to developing increasingly realistic and higher fidelity simulators. However, we lack any principled way to evaluate how good a "proxy domain" is, specifically in terms of how useful it is in helping us achieve our end objective of building an agent that performs well in the target domain. In this work, we investigate methods to address this need. We begin by clearly separating two uses of proxy domains that are often conflated: 1) their ability to be a faithful predictor of agent performance and 2)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Online Learning and Analytics
