Online Reinforcement Learning for Dynamic Multimedia Systems

Nicholas Mastronarde; Mihaela van der Schaar

arXiv:0906.5325·cs.LG·June 6, 2013

Online Reinforcement Learning for Dynamic Multimedia Systems

Nicholas Mastronarde, Mihaela van der Schaar

PDF

TL;DR

This paper introduces online reinforcement learning algorithms for dynamic multimedia systems, enabling layers to learn optimal long-term performance strategies in real-time, with improved speed and autonomy over existing methods.

Contribution

The paper presents novel centralized and decentralized reinforcement learning algorithms tailored for multimedia systems, incorporating accelerated learning with partial system knowledge.

Findings

01

Decentralized learning matches centralized performance.

02

Proposed algorithms outperform existing application-independent methods.

03

Accelerated learning significantly speeds up convergence.

Abstract

In our previous work, we proposed a systematic cross-layer framework for dynamic multimedia systems, which allows each layer to make autonomous and foresighted decisions that maximize the system's long-term performance, while meeting the application's real-time delay constraints. The proposed solution solved the cross-layer optimization offline, under the assumption that the multimedia system's probabilistic dynamics were known a priori. In practice, however, these dynamics are unknown a priori and therefore must be learned online. In this paper, we address this problem by allowing the multimedia system layers to learn, through repeated interactions with each other, to autonomously optimize the system's long-term performance at run-time. We propose two reinforcement learning algorithms for optimizing the system under different design constraints: the first algorithm solves the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.