Video Coding for Machines with Feature-Based Rate-Distortion Optimization
Kristian Fischer, Fabian Brand, Christian Herglotz, Andr\'e Kaup

TL;DR
This paper introduces a feature-based rate-distortion optimization method for video coding that improves compression efficiency for machine analysis tasks by replacing pixel-based metrics with neural network feature space metrics.
Contribution
It proposes a standard-compliant feature-based RDO that enhances coding performance for neural network analysis, outperforming conventional RDO in bitrate savings and quality.
Findings
Up to 5.49% bitrate reduction with HFRDO compared to VTM-8.0.
Coding gains of up to 9.95% by varying the quantization parameter.
Effective for neural network-based video analysis scenarios.
Abstract
Common state-of-the-art video codecs are optimized to deliver a low bitrate by providing a certain quality for the final human observer, which is achieved by rate-distortion optimization (RDO). But, with the steady improvement of neural networks solving computer vision tasks, more and more multimedia data is not observed by humans anymore, but directly analyzed by neural networks. In this paper, we propose a standard-compliant feature-based RDO (FRDO) that is designed to increase the coding performance, when the decoded frame is analyzed by a neural network in a video coding for machine scenario. To that extent, we replace the pixel-based distortion metrics in conventional RDO of VTM-8.0 with distortion metrics calculated in the feature space created by the first layers of a neural network. Throughout several tests with the segmentation network Mask R-CNN and single images from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRegion Proposal Network · RoIAlign · Softmax · Convolution · Mask R-CNN
