Attention Trajectories as a Diagnostic Axis for Deep Reinforcement Learning
Charlotte Beylier, Hannah Selder, Arthur Fleig, Simon M. Hofmann, Nico Scherf

TL;DR
This paper introduces a methodology to analyze deep reinforcement learning agents by tracking their attention over time, revealing insights into their learning process, biases, and vulnerabilities beyond traditional performance metrics.
Contribution
The paper presents a novel hierarchical attention profiling method that captures attention trajectories, enabling diagnosis of biases, overfitting, and behavioral strategies in reinforcement learning agents.
Findings
Uncovered algorithm-specific attention biases
Revealed unintended reward-driven strategies
Diagnosed overfitting to redundant sensory channels
Abstract
While deep reinforcement learning agents demonstrate high performance across domains, their internal decision processes remain difficult to interpret when evaluated only through performance metrics. In particular, it is poorly understood which input features agents rely on, how these dependencies evolve during training, and how they relate to behavior. We introduce a scientific methodology for analyzing the learning process through quantitative analysis of saliency. This approach aggregates saliency information at the object and modality level into hierarchical attention profiles, quantifying how agents allocate attention over time, thereby forming attention trajectories throughout training. Applied to Atari benchmarks, custom Pong environments, and muscle-actuated biomechanical user simulations in visuomotor interactive tasks, this methodology uncovers algorithm-specific attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Visual Attention and Saliency Detection · Neural and Behavioral Psychology Studies
