TL;DR
FDIM is a new universal video quality metric that combines deep and hand-crafted features to effectively assess both traditional and neural video codecs across SDR and HDR formats.
Contribution
The paper introduces FDIM, a hybrid feature-distance-based VQA metric that generalizes well across diverse codecs and formats, trained on a large-scale dataset.
Findings
FDIM achieves high correlation with subjective quality assessments.
It generalizes effectively across ten diverse SDR/HDR datasets.
FDIM outperforms existing VQA metrics in diverse codec scenarios.
Abstract
Video technology is advancing toward Ultra High Definition (UHD) and High Dynamic Range (HDR), which intensifies the need for higher compression efficiency for these high-specification videos. Beyond advances in traditional codecs, neural video codecs (NVCs) have attracted significant research attention and have evolved rapidly over the past few years. The coding artifacts of NVCs often exhibit content-varying and generative characteristics, which differ from those of conventional codecs and are challenging for traditional video quality assessment (VQA) methods to capture. Therefore, VQA metrics are required to generalize across different codecs, content types, and dynamic ranges to better support video codec research and evaluation. In this paper, we propose FDIM, a feature-distance-based generic video quality metric for both traditional and neural video codecs across SDR and HDR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
