Wrapper-Aware Rate-Distortion Optimization in Feature Coding for Machines
Samuel Fern\'andez-Mendui\~na, Hyomin Choi, Fabien Racap\'e, Eduardo Pavez, and Antonio Ortega

TL;DR
This paper introduces a wrapper-aware rate-distortion optimization method for feature coding in machine inference, improving compression efficiency by considering post-processing effects during bit allocation.
Contribution
It proposes a novel wrapper-aware weighted SSE metric and practical techniques for RDO, enhancing feature coding for neural network split-inference.
Findings
Matches state-of-the-art in FCM under MPEG conditions
Bridges codec generation gap with minimal overhead
Effective rate-distortion optimization considering wrappers
Abstract
Feature coding for machines (FCM) is a lossy compression paradigm for split-inference. The transmitter encodes the outputs of the first part of a neural network before sending them to the receiver for completing the inference. Practical FCM methods ``sandwich'' a traditional codec between pre- and post-processing neural networks, called wrappers, to make features easier to compress using video codecs. Since traditional codecs are non-differentiable, the wrappers are trained using a proxy codec, which is later replaced by a standard codec after training. These codecs perform rate-distortion optimization (RDO) based on the sum of squared errors (SSE). Because the RDO does not consider the post-processing wrapper, the inner codec can invest bits in preserving information that the post-processing later discards. In this paper, we modify the bit-allocation in the inner codec via a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Coding and Compression Technologies · Advanced Data Compression Techniques · Generative Adversarial Networks and Image Synthesis
