Sparse Reward Processes
Christos Dimitrakakis

TL;DR
This paper introduces a new class of learning problems called Sparse Reward Processes, modeling lifelong learning with task relations, and develops algorithms with theoretical performance bounds, emphasizing intrinsic motivation and curiosity.
Contribution
It formalizes Sparse Reward Processes as a decision-theoretic framework for lifelong learning, including algorithms and theoretical bounds, capturing task relations and intrinsic motivation.
Findings
Algorithms demonstrated effective learning in example domains.
Performance bounds established for certain problem instances.
Model captures the importance of curiosity in lifelong learning.
Abstract
We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained during execution of one task has value for the execution of another task. Consequently, the agent is intrinsically motivated to explore its environment beyond the degree necessary to solve the current task it has at hand. We develop a decision theoretic setting that generalises standard reinforcement learning tasks and captures this intuition. More precisely, we consider a multi-stage stochastic game between a learning agent and an opponent. We posit that the setting is a good model for the problem of life-long learning in uncertain environments, where while resources must be spent learning about currently important tasks, there is also the need to allocate effort towards learning about aspects of the world which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Artificial Intelligence in Games
