Generalized Policy Gradient with History-Aware Decision Transformer for Reliable Routing over Graph Signals
Xing Wei, Yuanhang Wang, Duoxiang Zhao, Zezhou Zhang, Hao Qin, Yuqi Ouyang

TL;DR
This paper introduces GPG-HT, a history-aware decision transformer framework that improves reliable routing in stochastic transportation networks by capturing complex spatial-temporal dependencies.
Contribution
It presents a novel integration of Decision Transformer with generalized policy gradient optimization for history-aware, context-sensitive path planning under uncertainty.
Findings
GPG-HT outperforms existing methods in on-time arrival probability.
Experiments on Sioux Falls and Anaheim networks validate the approach.
The framework effectively captures non-Markovian spatial-temporal dependencies.
Abstract
Reliable path planning in stochastic transportation networks requires decisions that account for uncertain and correlated travel times on irregular road graphs, rather than only minimizing expected delay. Such networks exhibit strong spatial-temporal coupling, where link travel times evolve as stochastic processes over graph edges, making the problem inherently sequential under uncertainty. Existing stochastic on-time arrival (SOTA) methods primarily depend on the current node and remaining budget, which limits their ability to exploit trajectory-level temporal structure and history-dependent correlations. This work proposes GPG-HT, a history-aware graph-signal policy framework that integrates a Decision Transformer with generalized policy gradient optimization for reliable routing. By attending to historical node-edge-time observations, GPG-HT captures non-Markovian spatial-temporal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
