A Mamba-based Perceptual Loss Function for Learning-based UGC Transcoding
Zihao Qi, Chen Feng, Fan Zhang, Xiaozhong Xu, Shan Liu, David Bull

TL;DR
This paper introduces a perceptual loss function based on a neural quality model using Mamba, improving the perceptual quality of user-generated content transcoding in neural video codecs.
Contribution
It proposes a novel perceptual loss function utilizing a Mamba-based neural quality model, shifting the reference from pixel accuracy to perceptual quality in UGC video transcoding.
Findings
Achieves 8.46% BD-rate savings over autoencoder baselines.
Achieves 12.89% BD-rate savings over implicit neural representation baselines.
Improves perceptual quality in neural video transcoding tasks.
Abstract
In user-generated content (UGC) transcoding, source videos typically suffer various degradations due to prior compression, editing, or suboptimal capture conditions. Consequently, existing video compression paradigms that solely optimize for fidelity relative to the reference become suboptimal, as they force the codec to replicate the inherent artifacts of the non-pristine source. To address this, we propose a novel perceptually inspired loss function for learning-based UGC video transcoding that redefines the role of the reference video, shifting it from a ground-truth pixel anchor to an informative contextual guide. Specifically, we train a lightweight neural quality model based on a Selective Structured State-Space Model (Mamba) optimized using a weakly-supervised Siamese ranking strategy. The proposed model is then integrated into the rate-distortion optimization (RDO) process of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Video Coding and Compression Technologies · Advanced Data Compression Techniques
