Online Network Source Optimization with Graph-Kernel MAB

Laura Toni; Pascal Frossard

arXiv:2307.03641·cs.LG·July 10, 2023

Online Network Source Optimization with Graph-Kernel MAB

Laura Toni, Pascal Frossard

PDF

Open Access

TL;DR

This paper introduces Grab-UCB, an online graph-kernel bandit algorithm for optimal source placement in large networks, leveraging spectral representations for sample efficiency and outperforming offline methods in simulations.

Contribution

The paper presents Grab-UCB, a novel online learning algorithm that uses adaptive graph dictionaries to efficiently optimize source placement in large networks.

Findings

01

Grab-UCB outperforms baseline offline methods in simulations.

02

The learning rate scales with spectral representation dimension, not network size.

03

Grab-arm-Light simplifies computation along the polytope edges.

Abstract

We propose Grab-UCB, a graph-kernel multi-arms bandit algorithm to learn online the optimal source placement in large scale networks, such that the reward obtained from a priori unknown network processes is maximized. The uncertainty calls for online learning, which suffers however from the curse of dimensionality. To achieve sample efficiency, we describe the network processes with an adaptive graph dictionary model, which typically leads to sparse spectral representations. This enables a data-efficient learning framework, whose learning rate scales with the dimension of the spectral representation model instead of the one of the network. We then propose Grab-UCB, an online sequential decision strategy that learns the parameters of the spectral representation while optimizing the action strategy. We derive the performance guarantees that depend on network parameters, which further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Machine Learning and ELM