Agent-State Construction with Auxiliary Inputs
Ruo Yu Tao, Adam White, Marlos C. Machado

TL;DR
This paper investigates the role of auxiliary inputs in reinforcement learning, demonstrating how they enhance agent state construction, improve discrimination of observations, and complement existing methods for better decision-making in complex environments.
Contribution
It introduces a systematic exploration of auxiliary inputs in RL, relating them to classic state construction methods and showing their benefits for longer temporal credit assignment.
Findings
Auxiliary inputs help discriminate aliased observations.
They enable more expressive and interpolated state features.
Complement existing recurrent methods to improve performance.
Abstract
In many, if not every realistic sequential decision-making task, the decision-making agent is not able to model the full complexity of the world. The environment is often much larger and more complex than the agent, a setting also known as partial observability. In such settings, the agent must leverage more than just the current sensory inputs; it must construct an agent state that summarizes previous interactions with the world. Currently, a popular approach for tackling this problem is to learn the agent-state function via a recurrent network from the agent's sensory stream as input. Many impressive reinforcement learning applications have instead relied on environment-specific functions to aid the agent's inputs for history summarization. These augmentations are done in multiple ways, from simple approaches like concatenating observations to more complex ones such as uncertainty…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Explainable Artificial Intelligence (XAI)
