Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement   Learning

Yun Qu; Yuhang Jiang; Boyuan Wang; Yixiu Mao; Cheems Wang; Chang Liu,; Xiangyang Ji

arXiv:2412.11120·cs.LG·January 10, 2025

Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning

Yun Qu, Yuhang Jiang, Boyuan Wang, Yixiu Mao, Cheems Wang, Chang Liu,, Xiangyang Ji

PDF

Open Access 2 Repos

TL;DR

This paper introduces LaRe, a novel framework that leverages Large Language Models to improve credit assignment in episodic reinforcement learning by using a multi-dimensional latent reward concept for better interpretability and reward redistribution.

Contribution

It proposes a symbolic-based decision-making framework that utilizes LLM-generated semantic code and latent reward self-verification to enhance credit assignment in RL tasks.

Findings

01

LaRe achieves superior temporal credit assignment compared to SOTA methods.

02

It effectively allocates contributions among multiple agents.

03

Outperforms policies trained with ground truth rewards in certain tasks.

Abstract

Reinforcement learning (RL) often encounters delayed and sparse feedback in real-world applications, even with only episodic rewards. Previous approaches have made some progress in reward redistribution for credit assignment but still face challenges, including training difficulties due to redundancy and ambiguous attributions stemming from overlooking the multifaceted nature of mission performance evaluation. Hopefully, Large Language Model (LLM) encompasses fruitful decision-making knowledge and provides a plausible tool for reward redistribution. Even so, deploying LLM in this case is non-trivial due to the misalignment between linguistic knowledge and the symbolic form requirement, together with inherent randomness and hallucinations in inference. To tackle these issues, we introduce LaRe, a novel LLM-empowered symbolic-based decision-making framework, to improve credit assignment.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Algorithms and Applications · Digital Platforms and Economics · Reinforcement Learning in Robotics