A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning

Yuzheng Hu; Fan Wu; Haotian Ye; David Forsyth; James Zou; Nan Jiang; Jiaqi W. Ma; Han Zhao

arXiv:2505.19281·cs.LG·October 7, 2025

A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning

Yuzheng Hu, Fan Wu, Haotian Ye, David Forsyth, James Zou, Nan Jiang, Jiaqi W. Ma, Han Zhao

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a local data attribution framework for online reinforcement learning, enabling interpretability and targeted training interventions, demonstrated through improved efficiency and performance on various benchmarks.

Contribution

It develops a novel local attribution method for online RL, specifically for PPO, and proposes an iterative filtering algorithm to enhance training efficiency and outcomes.

Findings

01

Framework enables diagnosis of learning and behavior analysis.

02

IIF reduces sample complexity and speeds up training.

03

Higher returns achieved in benchmark tasks.

Abstract

Online reinforcement learning (RL) excels in complex, safety-critical domains but suffers from sample inefficiency, training instability, and limited interpretability. Data attribution provides a principled way to trace model behavior back to training samples, yet existing methods assume fixed datasets, which is violated in online RL where each experience both updates the policy and shapes future data collection. In this paper, we initiate the study of data attribution for online RL, focusing on the widely used Proximal Policy Optimization (PPO) algorithm. We start by establishing a \emph{local} attribution framework, interpreting model checkpoints with respect to the records in the recent training buffer. We design two target functions, capturing agent action and cumulative return respectively, and measure each record's contribution through gradient similarity between its training loss…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ldaorl/lda-orl
pytorchOfficial

Videos

A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning· slideslive

Taxonomy

TopicsOpen Source Software Innovations · Reinforcement Learning in Robotics