On the Choice of Perception Loss Function for Learned Video Compression
Sadaf Salehkalaibar, Buu Phan, Jun Chen, Wei Yu, Ashish Khisti

TL;DR
This paper investigates how different perception loss functions affect low-latency learned video compression, revealing trade-offs between temporal consistency and distortion, and proposing decoder-side choice of perception metrics for flexible reconstruction quality.
Contribution
It introduces a comparative analysis of two perception loss functions in learned video compression, showing that decoder-side selection can achieve near-universal performance.
Findings
PLF-JD preserves temporal correlation better but increases distortion.
PLF-FMD results in lower distortion but less temporal consistency.
Decoder-side choice of PLF can match the performance of trained systems without specific PLF optimization.
Abstract
We study causal, low-latency, sequential video compression when the output is subjected to both a mean squared-error (MSE) distortion loss as well as a perception loss to target realism. Motivated by prior approaches, we consider two different perception loss functions (PLFs). The first, PLF-JD, considers the joint distribution (JD) of all the video frames up to the current one, while the second metric, PLF-FMD, considers the framewise marginal distributions (FMD) between the source and reconstruction. Using information theoretic analysis and deep-learning based experiments, we demonstrate that the choice of PLF can have a significant effect on the reconstruction, especially at low-bit rates. In particular, while the reconstruction based on PLF-JD can better preserve the temporal correlation across frames, it also imposes a significant penalty in distortion compared to PLF-FMD and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Image Processing Techniques · Sparse and Compressive Sensing Techniques · Image and Signal Denoising Methods
