PG-Flow: Deterministic implicit policy gradients for geometric product-form queueing networks
Youssef Ait El Mahjoub

TL;DR
PG-Flow introduces a novel deterministic policy-gradient method for optimizing steady-state performance in geometric product-form queueing networks, leveraging implicit differentiation for efficiency and convergence guarantees.
Contribution
It develops a new gradient framework that differentiates through steady-state equations, enabling efficient deterministic optimization in PFQNs.
Findings
Provides exact gradients via implicit differentiation.
Achieves linear-time computation in acyclic networks.
Demonstrates effectiveness in routing and energy control tasks.
Abstract
Product-form queueing networks (PFQNs) admit steady-state distributions that factorize into local terms, and in many classical PFQNs including Jackson, BCMP, G-networks, and Energy Packet Networks, these marginals are geometric and parametrized by local flow variables satisfying balance equations. While this structure yields closed-form expressions for key performance metrics, its use for deterministic steady-state optimization remains limited. We introduce PG-Flow, a deterministic policy-gradient framework that differentiates through the steady-state flow fixed-point equations, providing exact gradients via implicit differentiation and a local adjoint system while avoiding trajectory sampling and Poisson equations. We establish global convergence under structural assumptions (affine flow operators and convex local costs), and show that acyclic networks admit linear-time computation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Queuing Theory Analysis · Age of Information Optimization · Advanced Wireless Network Optimization
