Adversarial Network Optimization under Bandit Feedback: Maximizing Utility in Non-Stationary Multi-Hop Networks
Yan Dai, Longbo Huang

TL;DR
This paper introduces `UMO2`, a novel algorithm for adversarial network optimization in non-stationary multi-hop networks using bandit feedback, ensuring stability and near-optimal utility without prior network knowledge.
Contribution
The paper presents the first ANO algorithm capable of handling multi-hop networks and providing utility guarantees under bandit feedback, integrating online learning with Lyapunov methods.
Findings
Ensures network stability while maximizing utility.
Matches utility performance of mildly varying reference policies.
Introduces adaptive online learning algorithms for queue-dependent optimization.
Abstract
Stochastic Network Optimization (SNO) concerns scheduling in stochastic queueing systems. It has been widely studied in network theory. Classical SNO algorithms require network conditions to be stationary with time, which fails to capture the non-stationary components in many real-world scenarios. Many existing algorithms also assume knowledge of network conditions before decision, which rules out applications where unpredictability presents. Motivated by these issues, we consider Adversarial Network Optimization (ANO) under bandit feedback. Specifically, we consider the task of *i)* maximizing some unknown and time-varying utility function associated to scheduler's actions, where *ii)* the underlying network is a non-stationary multi-hop one whose conditions change arbitrarily with time, and *iii)* only bandit feedback (effect of actually deployed actions) is revealed after…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSecurity in Wireless Sensor Networks · Adversarial Robustness in Machine Learning · Sparse and Compressive Sensing Techniques
