Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Yun Qu, Yuhang Jiang, Boyuan Wang, Yixiu Mao, Cheems Wang, Chang Liu,, Xiangyang Ji

TL;DR
This paper introduces LaRe, a novel framework that leverages Large Language Models to improve credit assignment in episodic reinforcement learning by using a multi-dimensional latent reward concept for better interpretability and reward redistribution.
Contribution
It proposes a symbolic-based decision-making framework that utilizes LLM-generated semantic code and latent reward self-verification to enhance credit assignment in RL tasks.
Findings
LaRe achieves superior temporal credit assignment compared to SOTA methods.
It effectively allocates contributions among multiple agents.
Outperforms policies trained with ground truth rewards in certain tasks.
Abstract
Reinforcement learning (RL) often encounters delayed and sparse feedback in real-world applications, even with only episodic rewards. Previous approaches have made some progress in reward redistribution for credit assignment but still face challenges, including training difficulties due to redundancy and ambiguous attributions stemming from overlooking the multifaceted nature of mission performance evaluation. Hopefully, Large Language Model (LLM) encompasses fruitful decision-making knowledge and provides a plausible tool for reward redistribution. Even so, deploying LLM in this case is non-trivial due to the misalignment between linguistic knowledge and the symbolic form requirement, together with inherent randomness and hallucinations in inference. To tackle these issues, we introduce LaRe, a novel LLM-empowered symbolic-based decision-making framework, to improve credit assignment.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Digital Platforms and Economics · Reinforcement Learning in Robotics
