Sparse Reward Processes

Christos Dimitrakakis

arXiv:1201.2555·cs.LG·September 6, 2012·1 cites

Sparse Reward Processes

Christos Dimitrakakis

PDF

Open Access

TL;DR

This paper introduces a new class of learning problems called Sparse Reward Processes, modeling lifelong learning with task relations, and develops algorithms with theoretical performance bounds, emphasizing intrinsic motivation and curiosity.

Contribution

It formalizes Sparse Reward Processes as a decision-theoretic framework for lifelong learning, including algorithms and theoretical bounds, capturing task relations and intrinsic motivation.

Findings

01

Algorithms demonstrated effective learning in example domains.

02

Performance bounds established for certain problem instances.

03

Model captures the importance of curiosity in lifelong learning.

Abstract

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained during execution of one task has value for the execution of another task. Consequently, the agent is intrinsically motivated to explore its environment beyond the degree necessary to solve the current task it has at hand. We develop a decision theoretic setting that generalises standard reinforcement learning tasks and captures this intuition. More precisely, we consider a multi-stage stochastic game between a learning agent and an opponent. We posit that the setting is a good model for the problem of life-long learning in uncertain environments, where while resources must be spent learning about currently important tasks, there is also the need to allocate effort towards learning about aspects of the world which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Artificial Intelligence in Games