The Paradox of Choice: Using Attention in Hierarchical Reinforcement   Learning

Andrei Nica; Khimya Khetarpal; Doina Precup

arXiv:2201.09653·cs.LG·January 25, 2022

The Paradox of Choice: Using Attention in Hierarchical Reinforcement Learning

Andrei Nica, Khimya Khetarpal, Doina Precup

PDF

Open Access 1 Repo

TL;DR

This paper explores how limiting an AI agent's attention to meaningful choices via affordances can improve hierarchical reinforcement learning by addressing the paradox of choice, leading to faster and better decision-making.

Contribution

It introduces an online, model-free method to learn affordances that restrict available options, enhancing hierarchical RL in complex, long-horizon tasks.

Findings

01

Fewer, meaningful choices can accelerate learning.

02

Hard versus soft attention impacts data collection and learning.

03

The paradox of choice affects RL performance.

Abstract

Decision-making AI agents are often faced with two important challenges: the depth of the planning horizon, and the branching factor due to having many choices. Hierarchical reinforcement learning methods aim to solve the first problem, by providing shortcuts that skip over multiple time steps. To cope with the breadth, it is desirable to restrict the agent's attention at each step to a reasonable number of possible choices. The concept of affordances (Gibson, 1977) suggests that only certain actions are feasible in certain states. In this work, we model "affordances" through an attention mechanism that limits the available choices of temporally extended options. We present an online, model-free algorithm to learn affordances that can be used to further learn subgoal options. We investigate the role of hard versus soft attention in training data collection, abstract value learning in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

andreicnica/hrl_attention
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuction Theory and Applications · Complex Systems and Decision Making

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings