SCALED : Surrogate-gradient for Codec-Aware Learning of Downsampling in ABR Streaming
Esteban Pesnel (COMPACT), Julien Le Tanou, Michael Ropert, Thomas Maugey (COMPACT), Aline Roumy (COMPACT)

TL;DR
This paper introduces SCALED, a surrogate-gradient framework that enables end-to-end learning of downsampling in ABR streaming using real codecs, leading to improved compression performance.
Contribution
It presents a novel surrogate-gradient method that allows training with actual non-differentiable codecs, bridging the gap between training and deployment.
Findings
Achieved 5.19% BD-BR (PSNR) improvement over codec-agnostic methods.
Demonstrated consistent gains across multiple downsampling ratios.
Validated effectiveness on real, non-differentiable codecs.
Abstract
The rapid growth in video consumption has introduced significant challenges to modern streaming architectures. Over-the-Top (OTT) video delivery now predominantly relies on Adaptive Bitrate (ABR) streaming, which dynamically adjusts bitrate and resolution based on client-side constraints such as display capabilities and network bandwidth. This pipeline typically involves downsampling the original high-resolution content, encoding and transmitting it, followed by decoding and upsampling on the client side. Traditionally, these processing stages have been optimized in isolation, leading to suboptimal end-to-end rate-distortion (R-D) performance. The advent of deep learning has spurred interest in jointly optimizing the ABR pipeline using learned resampling methods. However, training such systems end-to-end remains challenging due to the non-differentiable nature of standard video codecs,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
