On Assessing the Usefulness of Proxy Domains for Developing and   Evaluating Embodied Agents

Anthony Courchesne (1; 2); Andrea Censi (3); Liam Paull (1 and; 2) ((1) Mila; (2) Universit\'e de Montr\'eal; (3) ETH Z\"urich)

arXiv:2109.14516·cs.LG·October 8, 2021

On Assessing the Usefulness of Proxy Domains for Developing and Evaluating Embodied Agents

Anthony Courchesne (1, 2), Andrea Censi (3), Liam Paull (1 and, 2) ((1) Mila, (2) Universit\'e de Montr\'eal, (3) ETH Z\"urich)

PDF

Open Access 1 Repo

TL;DR

This paper introduces new metrics to evaluate the usefulness of proxy domains in developing and assessing embodied agents, distinguishing their predictive accuracy from their utility in learning, especially when real-world data is hard to obtain.

Contribution

The paper proposes the proxy usefulness (PU) metrics, including relative predictive PU and learning PU, to better evaluate and optimize proxy domains for agent development.

Findings

01

New PU metrics effectively compare proxy domains.

02

Metrics help optimize proxy parameters for better agent performance.

03

Clarifies the role of proxy domains in simulation-based learning.

Abstract

In many situations it is either impossible or impractical to develop and evaluate agents entirely on the target domain on which they will be deployed. This is particularly true in robotics, where doing experiments on hardware is much more arduous than in simulation. This has become arguably more so in the case of learning-based agents. To this end, considerable recent effort has been devoted to developing increasingly realistic and higher fidelity simulators. However, we lack any principled way to evaluate how good a "proxy domain" is, specifically in terms of how useful it is in helping us achieve our end objective of building an agent that performs well in the target domain. In this work, we investigate methods to address this need. We begin by clearly separating two uses of proxy domains that are often conflated: 1) their ability to be a faithful predictor of agent performance and 2)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

duckietown/gym-duckietown
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Online Learning and Analytics