Meta-Reinforcement Learning via Buffering Graph Signatures for Live Video Streaming Events
Stefanos Antaris, Dimitrios Rafailidis, Sarunas Girdzijauskas

TL;DR
This paper introduces MELANIE, a meta-learning model that uses graph signatures and prioritized replay buffers to adaptively predict network capacity during live video streaming events, improving accuracy and responsiveness.
Contribution
The paper proposes a novel meta-learning framework with graph signature buffers and prioritized replay for real-time network capacity prediction in live streaming.
Findings
Achieved an average 25% relative gain over state-of-the-art methods.
Effectively adapts to low structural similarity between different streaming events.
Demonstrated robustness across three real-world datasets.
Abstract
In this study, we present a meta-learning model to adapt the predictions of the network's capacity between viewers who participate in a live video streaming event. We propose the MELANIE model, where an event is formulated as a Markov Decision Process, performing meta-learning on reinforcement learning tasks. By considering a new event as a task, we design an actor-critic learning scheme to compute the optimal policy on estimating the viewers' high-bandwidth connections. To ensure fast adaptation to new connections or changes among viewers during an event, we implement a prioritized replay memory buffer based on the Kullback-Leibler divergence of the reward/throughput of the viewers' connections. Moreover, we adopt a model-agnostic meta-learning framework to generate a global model from past events. As viewers scarcely participate in several events, the challenge resides on how to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Online Learning and Analytics
