Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
Seyeon Kim, Joonhun Lee, Namhoon Cho, Sungjun Han, Wooseop Hwang

TL;DR
This paper introduces a generalized Gaussian error modeling framework for uncertainty-aware reinforcement learning, improving the estimation of aleatoric and epistemic uncertainties and enhancing policy performance.
Contribution
It proposes a novel generalized Gaussian distribution approach for TD errors, incorporating kurtosis and shape parameters to better model uncertainties in deep RL.
Findings
Improved uncertainty estimation accuracy.
Enhanced policy performance in experiments.
Robust weighting scheme for uncertainty mitigation.
Abstract
Conventional uncertainty-aware temporal difference (TD) learning often assumes a zero-mean Gaussian distribution for TD errors, leading to inaccurate error representations and compromised uncertainty estimation. We introduce a novel framework for generalized Gaussian error modeling in deep reinforcement learning to enhance the flexibility of error distribution modeling by incorporating additional higher-order moment, particularly kurtosis, thereby improving the estimation and mitigation of data-dependent aleatoric uncertainty. We examine the influence of the shape parameter of the generalized Gaussian distribution (GGD) on aleatoric uncertainty and provide a closed-form expression that demonstrates an inverse relationship between uncertainty and the shape parameter. Additionally, we propose a theoretically grounded weighting scheme to address epistemic uncertainty by fully leveraging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Multi-Objective Optimization Algorithms · Reinforcement Learning in Robotics
