Beyond Bayes-optimality: meta-learning what you know you don't know

Jordi Grau-Moya; Gr\'egoire Del\'etang; Markus Kunesch; Tim Genewein,; Elliot Catt; Kevin Li; Anian Ruoss; Chris Cundy; Joel Veness; Jane Wang,; Marcus Hutter; Christopher Summerfield; Shane Legg; Pedro Ortega

arXiv:2209.15618·cs.AI·October 13, 2022

Beyond Bayes-optimality: meta-learning what you know you don't know

Jordi Grau-Moya, Gr\'egoire Del\'etang, Markus Kunesch, Tim Genewein,, Elliot Catt, Kevin Li, Anian Ruoss, Chris Cundy, Joel Veness, Jane Wang,, Marcus Hutter, Christopher Summerfield, Shane Legg, Pedro Ortega

PDF

Open Access

TL;DR

This paper extends meta-learning agents beyond Bayes-optimality by developing algorithms that produce risk- and ambiguity-sensitive behaviors, aligning more closely with human decision-making in safety-critical contexts.

Contribution

It introduces modified meta-training algorithms that generate risk- and ambiguity-sensitive agents through altered experience-generation processes.

Findings

01

Agents exhibit risk sensitivity in decision-making tasks.

02

Agents demonstrate ambiguity sensitivity in novel situations.

03

Modified algorithms successfully produce non-Bayes-optimal behaviors.

Abstract

Meta-training agents with memory has been shown to culminate in Bayes-optimal agents, which casts Bayes-optimality as the implicit solution to a numerical optimization problem rather than an explicit modeling assumption. Bayes-optimal agents are risk-neutral, since they solely attune to the expected return, and ambiguity-neutral, since they act in new situations as if the uncertainty were known. This is in contrast to risk-sensitive agents, which additionally exploit the higher-order moments of the return, and ambiguity-sensitive agents, which act differently when recognizing situations in which they lack knowledge. Humans are also known to be averse to ambiguity and sensitive to risk in ways that aren't Bayes-optimal, indicating that such sensitivity can confer advantages, especially in safety-critical situations. How can we extend the meta-learning protocol to generate risk- and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Adversarial Robustness in Machine Learning

MethodsTest