DQ-Ladder: A Deep Reinforcement Learning-based Bitrate Ladder for Adaptive Video Streaming
Reza Farahani, Zoha Azimi, Vignesh V Menon, Hermann Hellwagner, Radu Prodan, Schahram Dustdar, Christian Timmerer

TL;DR
DQ-Ladder introduces a deep reinforcement learning approach to optimize bitrate ladders for adaptive video streaming, balancing quality, decoding time, and resolution dynamically based on content characteristics.
Contribution
It presents a novel DRL-based scheme that predicts decoding time and quality metrics to construct adaptive bitrate ladders without exhaustive encoding.
Findings
Achieves at least 10.3% BD-rate reduction in XPSNR compared to HLS ladder.
Reduces decoding time by 22%.
Remains robust with up to 20% prediction noise.
Abstract
Adaptive streaming of segmented video over HTTP typically relies on a predefined set of bitrate-resolution pairs, known as a bitrate ladder. However, fixed ladders often overlook variations in content and decoding complexities, leading to suboptimal trade-offs between encoding time, decoding efficiency, and video quality. This article introduces DQ-Ladder, a deep reinforcement learning (DRL)-based scheme for constructing time- and quality-aware bitrate ladders for adaptive video streaming applications. DQ-Ladder employs predicted decoding time, quality scores, and bitrate levels per segment as inputs to a Deep Q-Network (DQN) agent, guided by a weighted reward function of decoding time, video quality, and resolution smoothness. We leverage machine learning models to predict decoding time, bitrate level, and objective quality metrics (VMAF, XPSNR), eliminating the need for exhaustive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Video Coding and Compression Technologies · Network Traffic and Congestion Control
