Adversarial Deep Learning for Online Resource Allocation

Bingqian Du; Zhiyi Huang; Chuan Wu

arXiv:2111.10285·cs.LG·November 22, 2021·1 cites

Adversarial Deep Learning for Online Resource Allocation

Bingqian Du, Zhiyi Huang, Chuan Wu

PDF

Open Access

TL;DR

This paper introduces a novel deep learning approach inspired by adversarial training to design online algorithms with minimized worst-case performance gaps, demonstrated through resource allocation problems.

Contribution

It is the first to use deep neural networks to develop online algorithms focusing on worst-case guarantees, employing a new per-round update method for better convergence.

Findings

01

The learned algorithm outperforms existing online algorithms in various settings.

02

The proposed method ensures convergence to Nash equilibrium.

03

Empirical results validate the effectiveness of the approach.

Abstract

Online algorithm is an important branch in algorithm design. Designing online algorithms with a bounded competitive ratio (in terms of worst-case performance) can be hard and usually relies on problem-specific assumptions. Inspired by adversarial training from Generative Adversarial Net (GAN) and the fact that competitive ratio of an online algorithm is based on worst-case input, we adopt deep neural networks to learn an online algorithm for a resource allocation and pricing problem from scratch, with the goal that the performance gap between offline optimum and the learned online algorithm can be minimized for worst-case input. Specifically, we leverage two neural networks as algorithm and adversary respectively and let them play a zero sum game, with the adversary being responsible for generating worst-case input while the algorithm learns the best strategy based on the input…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Optimization and Search Problems