Extending Environments To Measure Self-Reflection In Reinforcement   Learning

Samuel Allen Alexander; Michael Castaneda; Kevin Compher; Oscar; Martinez

arXiv:2110.06890·cs.AI·July 21, 2022

Extending Environments To Measure Self-Reflection In Reinforcement Learning

Samuel Allen Alexander, Michael Castaneda, Kevin Compher, Oscar, Martinez

PDF

Open Access 1 Repo

TL;DR

This paper introduces an extended environment framework for reinforcement learning that emphasizes the importance of self-reflection in agents, proposing a new way to measure and enhance this trait through environment design.

Contribution

It extends traditional RL environments to include agent simulation, proposing a new measure of self-reflection and demonstrating a transformation that improves agent performance in these environments.

Findings

01

Extended environments can be used to measure self-reflection in RL agents.

02

A simple transformation can improve agent performance in extended environments.

03

Self-reflection may be crucial for achieving better performance in complex RL settings.

Abstract

We consider an extended notion of reinforcement learning in which the environment can simulate the agent and base its outputs on the agent's hypothetical behavior. Since good performance usually requires paying attention to whatever things the environment's outputs are based on, we argue that for an agent to achieve on-average good performance across many such extended environments, it is necessary for the agent to self-reflect. Thus weighted-average performance over the space of all suitably well-behaved extended environments could be considered a way of measuring how self-reflective an agent is. We give examples of extended environments and introduce a simple transformation which experimentally seems to increase some standard RL agents' performance in a certain type of extended environment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

semitrivial/extendedenvironments
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputability, Logic, AI Algorithms