Retrospective In-Context Learning for Temporal Credit Assignment with Large Language Models

Wen-Tse Chen; Jiayu Chen; Fahim Tajwar; Hao Zhu; Xintong Duan; Ruslan Salakhutdinov; Jeff Schneider

arXiv:2602.17497·cs.LG·February 20, 2026

Retrospective In-Context Learning for Temporal Credit Assignment with Large Language Models

Wen-Tse Chen, Jiayu Chen, Fahim Tajwar, Hao Zhu, Xintong Duan, Ruslan Salakhutdinov, Jeff Schneider

PDF

Open Access

TL;DR

This paper introduces RICOL, a framework that uses large language models to improve temporal credit assignment in reinforcement learning, achieving high sample efficiency and comparable performance to traditional methods.

Contribution

It proposes a novel retrospective in-context learning approach leveraging LLMs for dense reward estimation, enhancing sample efficiency in RL.

Findings

01

RICL accurately estimates the advantage function with limited samples.

02

RICOL achieves comparable performance to traditional RL algorithms.

03

The approach improves sample efficiency in four BabyAI scenarios.

Abstract

Learning from self-sampled data and sparse environmental feedback remains a fundamental challenge in training self-evolving agents. Temporal credit assignment mitigates this issue by transforming sparse feedback into dense supervision signals. However, previous approaches typically depend on learning task-specific value functions for credit assignment, which suffer from poor sample efficiency and limited generalization. In this work, we propose to leverage pretrained knowledge from large language models (LLMs) to transform sparse rewards into dense training signals (i.e., the advantage function) through retrospective in-context learning (RICL). We further propose an online learning framework, RICOL, which iteratively refines the policy based on the credit assignment results from RICL. We empirically demonstrate that RICL can accurately estimate the advantage function with limited…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics · Topic Modeling