The Paradox of Choice: Using Attention in Hierarchical Reinforcement Learning
Andrei Nica, Khimya Khetarpal, Doina Precup

TL;DR
This paper explores how limiting an AI agent's attention to meaningful choices via affordances can improve hierarchical reinforcement learning by addressing the paradox of choice, leading to faster and better decision-making.
Contribution
It introduces an online, model-free method to learn affordances that restrict available options, enhancing hierarchical RL in complex, long-horizon tasks.
Findings
Fewer, meaningful choices can accelerate learning.
Hard versus soft attention impacts data collection and learning.
The paradox of choice affects RL performance.
Abstract
Decision-making AI agents are often faced with two important challenges: the depth of the planning horizon, and the branching factor due to having many choices. Hierarchical reinforcement learning methods aim to solve the first problem, by providing shortcuts that skip over multiple time steps. To cope with the breadth, it is desirable to restrict the agent's attention at each step to a reasonable number of possible choices. The concept of affordances (Gibson, 1977) suggests that only certain actions are feasible in certain states. In this work, we model "affordances" through an attention mechanism that limits the available choices of temporally extended options. We present an online, model-free algorithm to learn affordances that can be used to further learn subgoal options. We investigate the role of hard versus soft attention in training data collection, abstract value learning in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Complex Systems and Decision Making
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
