Generalized Policy Gradient with History-Aware Decision Transformer for Reliable Routing over Graph Signals

Xing Wei; Yuanhang Wang; Duoxiang Zhao; Zezhou Zhang; Hao Qin; Yuqi Ouyang

arXiv:2508.17218·cs.LG·May 18, 2026

Generalized Policy Gradient with History-Aware Decision Transformer for Reliable Routing over Graph Signals

Xing Wei, Yuanhang Wang, Duoxiang Zhao, Zezhou Zhang, Hao Qin, Yuqi Ouyang

PDF

TL;DR

This paper introduces GPG-HT, a history-aware decision transformer framework that improves reliable routing in stochastic transportation networks by capturing complex spatial-temporal dependencies.

Contribution

It presents a novel integration of Decision Transformer with generalized policy gradient optimization for history-aware, context-sensitive path planning under uncertainty.

Findings

01

GPG-HT outperforms existing methods in on-time arrival probability.

02

Experiments on Sioux Falls and Anaheim networks validate the approach.

03

The framework effectively captures non-Markovian spatial-temporal dependencies.

Abstract

Reliable path planning in stochastic transportation networks requires decisions that account for uncertain and correlated travel times on irregular road graphs, rather than only minimizing expected delay. Such networks exhibit strong spatial-temporal coupling, where link travel times evolve as stochastic processes over graph edges, making the problem inherently sequential under uncertainty. Existing stochastic on-time arrival (SOTA) methods primarily depend on the current node and remaining budget, which limits their ability to exploit trajectory-level temporal structure and history-dependent correlations. This work proposes GPG-HT, a history-aware graph-signal policy framework that integrates a Decision Transformer with generalized policy gradient optimization for reliable routing. By attending to historical node-edge-time observations, GPG-HT captures non-Markovian spatial-temporal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.