Strategically Linked Decisions in Long-Term Planning and Reinforcement Learning

Alihan H\"uy\"uk; Finale Doshi-Velez

arXiv:2505.16833·cs.LG·May 23, 2025

Strategically Linked Decisions in Long-Term Planning and Reinforcement Learning

Alihan H\"uy\"uk, Finale Doshi-Velez

PDF

Open Access

TL;DR

This paper introduces strategic link scores to quantify dependencies between decisions in long-term planning, enhancing understanding and performance in reinforcement learning, decision support, and traffic simulation.

Contribution

It proposes a novel metric, strategic link scores, to analyze decision dependencies, with applications in explaining RL agents, improving decision support, and characterizing planning horizons.

Findings

01

Strategic link scores reveal decision dependencies in RL and real-world scenarios.

02

Applying these scores improves explanation and performance of decision-making systems.

03

Analysis of traffic routing demonstrates the method's ability to characterize planning horizons.

Abstract

Long-term planning, as in reinforcement learning (RL), involves finding strategies: actions that collectively work toward a goal rather than individually optimizing their immediate outcomes. As part of a strategy, some actions are taken at the expense of short-term benefit to enable future actions with even greater returns. These actions are only advantageous if followed up by the actions they facilitate, consequently, they would not have been taken if those follow-ups were not available. In this paper, we quantify such dependencies between planned actions with strategic link scores: the drop in the likelihood of one decision under the constraint that a follow-up decision is no longer available. We demonstrate the utility of strategic link scores through three practical applications: (i) explaining black-box RL agents by identifying strategically linked pairs among decisions they make,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Traffic control and management · Autonomous Vehicle Technology and Safety