Efficient Bitrate Ladder Construction for Content-Optimized Adaptive Video Streaming
Angeliki V. Katsenou, Joel Sole, David R. Bull

TL;DR
This paper introduces a machine learning-based approach to efficiently construct content-optimized bitrate ladders for adaptive video streaming, significantly reducing encoding costs while maintaining high quality.
Contribution
It proposes a novel method that predicts content-specific bitrate ladders using spatio-temporal features, reducing encoding efforts compared to exhaustive search methods.
Findings
Achieves up to 89.06% reduction in encodings compared to exhaustive search.
Maintains a low average rate difference of 1.78% with the exhaustive approach.
Hybrid method further reduces encodings by 83.83% with minimal quality loss.
Abstract
One of the challenges faced by many video providers is the heterogeneity of network specifications, user requirements, and content compression performance. The universal solution of a fixed bitrate ladder is inadequate in ensuring a high quality of user experience without re-buffering or introducing annoying compression artifacts. However, a content-tailored solution, based on extensively encoding across all resolutions and over a wide quality range is highly expensive in terms of computational, financial, and energy costs. Inspired by this, we propose an approach that exploits machine learning to predict a content-optimized bitrate ladder. The method extracts spatio-temporal features from the uncompressed content, trains machine-learning models to predict the Pareto front parameters, and, based on that, builds the ladder within a defined bitrate range. The method has the benefit of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
