Adversarial Network Optimization under Bandit Feedback: Maximizing   Utility in Non-Stationary Multi-Hop Networks

Yan Dai; Longbo Huang

arXiv:2408.16215·math.OC·August 30, 2024

Adversarial Network Optimization under Bandit Feedback: Maximizing Utility in Non-Stationary Multi-Hop Networks

Yan Dai, Longbo Huang

PDF

Open Access

TL;DR

This paper introduces `UMO2`, a novel algorithm for adversarial network optimization in non-stationary multi-hop networks using bandit feedback, ensuring stability and near-optimal utility without prior network knowledge.

Contribution

The paper presents the first ANO algorithm capable of handling multi-hop networks and providing utility guarantees under bandit feedback, integrating online learning with Lyapunov methods.

Findings

01

Ensures network stability while maximizing utility.

02

Matches utility performance of mildly varying reference policies.

03

Introduces adaptive online learning algorithms for queue-dependent optimization.

Abstract

Stochastic Network Optimization (SNO) concerns scheduling in stochastic queueing systems. It has been widely studied in network theory. Classical SNO algorithms require network conditions to be stationary with time, which fails to capture the non-stationary components in many real-world scenarios. Many existing algorithms also assume knowledge of network conditions before decision, which rules out applications where unpredictability presents. Motivated by these issues, we consider Adversarial Network Optimization (ANO) under bandit feedback. Specifically, we consider the task of *i)* maximizing some unknown and time-varying utility function associated to scheduler's actions, where *ii)* the underlying network is a non-stationary multi-hop one whose conditions change arbitrarily with time, and *iii)* only bandit feedback (effect of actually deployed actions) is revealed after…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSecurity in Wireless Sensor Networks · Adversarial Robustness in Machine Learning · Sparse and Compressive Sensing Techniques