Attention Trajectories as a Diagnostic Axis for Deep Reinforcement Learning

Charlotte Beylier; Hannah Selder; Arthur Fleig; Simon M. Hofmann; Nico Scherf

arXiv:2511.20591·cs.LG·December 1, 2025

Attention Trajectories as a Diagnostic Axis for Deep Reinforcement Learning

Charlotte Beylier, Hannah Selder, Arthur Fleig, Simon M. Hofmann, Nico Scherf

PDF

Open Access

TL;DR

This paper introduces a methodology to analyze deep reinforcement learning agents by tracking their attention over time, revealing insights into their learning process, biases, and vulnerabilities beyond traditional performance metrics.

Contribution

The paper presents a novel hierarchical attention profiling method that captures attention trajectories, enabling diagnosis of biases, overfitting, and behavioral strategies in reinforcement learning agents.

Findings

01

Uncovered algorithm-specific attention biases

02

Revealed unintended reward-driven strategies

03

Diagnosed overfitting to redundant sensory channels

Abstract

While deep reinforcement learning agents demonstrate high performance across domains, their internal decision processes remain difficult to interpret when evaluated only through performance metrics. In particular, it is poorly understood which input features agents rely on, how these dependencies evolve during training, and how they relate to behavior. We introduce a scientific methodology for analyzing the learning process through quantitative analysis of saliency. This approach aggregates saliency information at the object and modality level into hierarchical attention profiles, quantifying how agents allocate attention over time, thereby forming attention trajectories throughout training. Applied to Atari benchmarks, custom Pong environments, and muscle-actuated biomechanical user simulations in visuomotor interactive tasks, this methodology uncovers algorithm-specific attention…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Visual Attention and Saliency Detection · Neural and Behavioral Psychology Studies