Content-adaptive Encoder Preset Prediction for Adaptive Live Streaming
Vignesh V Menon, Hadi Amirpour, Prajit T Rajendran, Mohammad Ghanbari,, Christian Timmerer

TL;DR
This paper presents CAPS, a content-adaptive scheme that predicts optimal encoder presets for live streaming, improving quality and CPU utilization by analyzing video content and encoding parameters.
Contribution
It introduces a novel content-adaptive preset prediction method for live streaming that enhances quality and efficiency over fixed preset approaches.
Findings
Achieves 0.83 dB PSNR improvement at same bitrate
Yields 3.81 VMAF score increase
Maintains encoding speed while reducing CPU idle time
Abstract
In live streaming applications, a fixed set of bitrate-resolution pairs (known as bitrate ladder) is generally used to avoid additional pre-processing run-time to analyze the complexity of every video content and determine the optimized bitrate ladder. Furthermore, live encoders use the fastest available preset for encoding to ensure the minimum possible latency in streaming. For live encoders, it is expected that the encoding speed is equal to the video framerate. An optimized encoding preset may result in (i) increased Quality of Experience (QoE) and (ii) improved CPU utilization while encoding. In this light, this paper introduces a Content-Adaptive encoder Preset prediction Scheme (CAPS) for adaptive live video streaming applications. In this scheme, the encoder preset is determined using Discrete Cosine Transform (DCT)-energy-based low-complexity spatial and temporal features for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Video Coding and Compression Technologies · Advanced Image Processing Techniques
