UGAE: A Novel Approach to Non-exponential Discounting

Ariel Kwiatkowski; Vicky Kalogeiton; Julien Pettr\'e; Marie-Paule Cani

arXiv:2302.05740·cs.LG·February 14, 2023·1 cites

UGAE: A Novel Approach to Non-exponential Discounting

Ariel Kwiatkowski, Vicky Kalogeiton, Julien Pettr\'e, Marie-Paule Cani

PDF

Open Access

TL;DR

This paper introduces UGAE, a flexible advantage estimation method enabling the use of non-exponential discounting in reinforcement learning, improving agent performance and aligning more closely with human-like reward preferences.

Contribution

The paper proposes UGAE, a universal advantage estimation technique that supports arbitrary discounting, and introduces Beta-weighted discounting, enhancing flexibility in discounting choices for RL agents.

Findings

01

Agents with non-exponential discounting outperform Monte Carlo baselines.

02

UGAE with Beta-weighted discounting achieves superior results on RL benchmarks.

03

The method is simple to integrate into existing advantage-based algorithms.

Abstract

The discounting mechanism in Reinforcement Learning determines the relative importance of future and present rewards. While exponential discounting is widely used in practice, non-exponential discounting methods that align with human behavior are often desirable for creating human-like agents. However, non-exponential discounting methods cannot be directly applied in modern on-policy actor-critic algorithms. To address this issue, we propose Universal Generalized Advantage Estimation (UGAE), which allows for the computation of GAE advantage values with arbitrary discounting. Additionally, we introduce Beta-weighted discounting, a continuous interpolation between exponential and hyperbolic discounting, to increase flexibility in choosing a discounting method. To showcase the utility of UGAE, we provide an analysis of the properties of various discounting methods. We also show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsClimate Change Policy and Economics · Auction Theory and Applications · Energy Efficiency and Management

MethodsALIGN