Hyperbolic Discounting and Learning over Multiple Horizons

William Fedus; Carles Gelada; Yoshua Bengio; Marc G. Bellemare; Hugo; Larochelle

arXiv:1902.06865·stat.ML·March 1, 2019·52 cites

Hyperbolic Discounting and Learning over Multiple Horizons

William Fedus, Carles Gelada, Yoshua Bengio, Marc G. Bellemare, Hugo, Larochelle

PDF

Open Access 1 Repo

TL;DR

This paper explores hyperbolic discounting in reinforcement learning, proposing a simple method to incorporate it and discovering that learning over multiple horizons enhances RL performance.

Contribution

It introduces a straightforward approach to implement hyperbolic discounting in RL and reveals the benefits of multi-horizon value learning as an auxiliary task.

Findings

01

Hyperbolic discounting can be approximated with simple modifications in RL.

02

Learning value functions over multiple horizons improves RL performance.

03

Multi-horizon learning often outperforms standard RL agents.

Abstract

Reinforcement learning (RL) typically defines a discount factor as part of the Markov Decision Process. The discount factor values future rewards by an exponential scheme that leads to theoretical convergence guarantees of the Bellman equation. However, evidence from psychology, economics and neuroscience suggests that humans and animals instead have hyperbolic time-preferences. In this work we revisit the fundamentals of discounting in RL and bridge this disconnect by implementing an RL agent that acts via hyperbolic discounting. We demonstrate that a simple approach approximates hyperbolic discount functions while still using familiar temporal-difference learning techniques in RL. Additionally, and independent of hyperbolic discounting, we make a surprising discovery that simultaneously learning value functions over multiple time-horizons is an effective auxiliary task which often…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-research/google-research/tree/master/hyperbolic_discount
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematical and Theoretical Analysis · Computability, Logic, AI Algorithms · Artificial Intelligence in Games