The Value of Information When Deciding What to Learn

Dilip Arumugam; Benjamin Van Roy

arXiv:2110.13973·cs.LG·October 28, 2021

The Value of Information When Deciding What to Learn

Dilip Arumugam, Benjamin Van Roy

PDF

Open Access 1 Video

TL;DR

This paper explores how to optimally select learning targets and acquire information efficiently in sequential decision-making, improving upon existing methods by coupling target design with information acquisition.

Contribution

It introduces a novel approach that combines optimal information acquisition with the design of learning targets, building on information-directed sampling and rate-distortion theory.

Findings

01

Empirical results demonstrate the value of information in learning target selection.

02

The proposed method improves efficiency in information acquisition.

03

Insights connect rate-distortion theory with learning target design.

Abstract

All sequential decision-making agents explore so as to acquire knowledge about a particular target. It is often the responsibility of the agent designer to construct this target which, in rich and complex environments, constitutes a onerous burden; without full knowledge of the environment itself, a designer may forge a sub-optimal learning target that poorly balances the amount of information an agent must acquire to identify the target against the target's associated performance shortfall. While recent work has developed a connection between learning targets and rate-distortion theory to address this challenge and empower agents that decide what to learn in an automated fashion, the proposed algorithm does not optimally tackle the equally important challenge of efficient information acquisition. In this work, building upon the seminal design principle of information-directed sampling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

The Value of Information When Deciding What to Learn· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Reinforcement Learning in Robotics