Generalized Gaussian Temporal Difference Error for Uncertainty-aware   Reinforcement Learning

Seyeon Kim; Joonhun Lee; Namhoon Cho; Sungjun Han; Wooseop Hwang

arXiv:2408.02295·cs.LG·February 4, 2025

Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning

Seyeon Kim, Joonhun Lee, Namhoon Cho, Sungjun Han, Wooseop Hwang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a generalized Gaussian error modeling framework for uncertainty-aware reinforcement learning, improving the estimation of aleatoric and epistemic uncertainties and enhancing policy performance.

Contribution

It proposes a novel generalized Gaussian distribution approach for TD errors, incorporating kurtosis and shape parameters to better model uncertainties in deep RL.

Findings

01

Improved uncertainty estimation accuracy.

02

Enhanced policy performance in experiments.

03

Robust weighting scheme for uncertainty mitigation.

Abstract

Conventional uncertainty-aware temporal difference (TD) learning often assumes a zero-mean Gaussian distribution for TD errors, leading to inaccurate error representations and compromised uncertainty estimation. We introduce a novel framework for generalized Gaussian error modeling in deep reinforcement learning to enhance the flexibility of error distribution modeling by incorporating additional higher-order moment, particularly kurtosis, thereby improving the estimation and mitigation of data-dependent aleatoric uncertainty. We examine the influence of the shape parameter of the generalized Gaussian distribution (GGD) on aleatoric uncertainty and provide a closed-form expression that demonstrates an inverse relationship between uncertainty and the shape parameter. Additionally, we propose a theoretically grounded weighting scheme to address epistemic uncertainty by fully leveraging…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ait-lab/ggtde
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Multi-Objective Optimization Algorithms · Reinforcement Learning in Robotics