SCALED : Surrogate-gradient for Codec-Aware Learning of Downsampling in ABR Streaming

Esteban Pesnel (COMPACT); Julien Le Tanou; Michael Ropert; Thomas Maugey (COMPACT); Aline Roumy (COMPACT)

arXiv:2602.00198·eess.IV·February 3, 2026

SCALED : Surrogate-gradient for Codec-Aware Learning of Downsampling in ABR Streaming

Esteban Pesnel (COMPACT), Julien Le Tanou, Michael Ropert, Thomas Maugey (COMPACT), Aline Roumy (COMPACT)

PDF

TL;DR

This paper introduces SCALED, a surrogate-gradient framework that enables end-to-end learning of downsampling in ABR streaming using real codecs, leading to improved compression performance.

Contribution

It presents a novel surrogate-gradient method that allows training with actual non-differentiable codecs, bridging the gap between training and deployment.

Findings

01

Achieved 5.19% BD-BR (PSNR) improvement over codec-agnostic methods.

02

Demonstrated consistent gains across multiple downsampling ratios.

03

Validated effectiveness on real, non-differentiable codecs.

Abstract

The rapid growth in video consumption has introduced significant challenges to modern streaming architectures. Over-the-Top (OTT) video delivery now predominantly relies on Adaptive Bitrate (ABR) streaming, which dynamically adjusts bitrate and resolution based on client-side constraints such as display capabilities and network bandwidth. This pipeline typically involves downsampling the original high-resolution content, encoding and transmitting it, followed by decoding and upsampling on the client side. Traditionally, these processing stages have been optimized in isolation, leading to suboptimal end-to-end rate-distortion (R-D) performance. The advent of deep learning has spurred interest in jointly optimizing the ABR pipeline using learned resampling methods. However, training such systems end-to-end remains challenging due to the non-differentiable nature of standard video codecs,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.