Near Optimal Adaptive Shortest Path Routing with Stochastic Links States under Adversarial Attack
Pan Zhou, Lin Cheng, Dapeng Oliver Wu

TL;DR
This paper introduces an adaptive shortest path routing algorithm that effectively handles stochastic and adversarial link states using a multi-armed bandit framework, achieving near-optimal performance in complex network environments.
Contribution
It formulates the adaptive routing problem as a combinatorial adversarial multi-armed bandit, introducing a novel control parameter and providing scalable algorithms with strong theoretical guarantees.
Findings
Achieves 65.3% reduction in network delay compared to existing methods.
Reduces learning duration by 81.5% under specified network delay.
Handles environment features automatically with near-optimal learning performance.
Abstract
We consider the shortest path routing (SPR) of a network with stochastically time varying link metrics under potential adversarial attacks. Due to potential denial of service attacks, the distributions of link states could be stochastic (benign) or adversarial at different temporal and spatial locations. Without any \emph{a priori}, designing an adaptive SPR protocol to cope with all possible situations in practice optimally is a very challenging issue. In this paper, we present the first solution by formulating it as a multi-armed bandit (MAB) problem. By introducing a novel control parameter into the exploration phase for each link, a martingale inequality is applied in the our combinatorial adversarial MAB framework. As such, our proposed algorithms could automatically detect features of the environment within a unified framework and find the optimal SPR strategies with almost…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Optimization and Search Problems
