Towards Top-Down Stereo Image Quality Assessment via Stereo Attention
Huilin Zhang, Sumei Li, Haoxiang Chang, Peiming Lin

TL;DR
This paper introduces SATNet, a top-down stereo image quality assessment model that uses attention mechanisms and binocular response tuning to improve the accuracy of 3D content quality evaluation.
Contribution
The paper proposes a novel top-down stereo attention network with adaptive attention and energy coefficient mechanisms for enhanced SIQA performance.
Findings
Outperforms existing SIQA methods on benchmark datasets.
Effectively models human visual system characteristics in stereo quality assessment.
Demonstrates the importance of top-down attention in stereo image quality evaluation.
Abstract
Stereo image quality assessment (SIQA) plays a crucial role in evaluating and improving the visual experience of 3D content. Existing visual properties-based methods for SIQA have achieved promising performance. However, these approaches ignore the top-down philosophy, leading to a lack of a comprehensive grasp of the human visual system (HVS) and SIQA. This paper presents a novel Stereo AttenTion Network (SATNet), which employs a top-down perspective to guide the quality assessment process. Specifically, our generalized Stereo AttenTion (SAT) structure adapts components and input/output for stereo scenarios. It leverages the fusion-generated attention map as a higher-level binocular modulator to influence two lower-level monocular features, allowing progressive recalibration of both throughout the pipeline. Additionally, we introduce an Energy Coefficient (EC) to flexibly tune the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual perception and processing mechanisms · Advanced Image Fusion Techniques · Image and Video Quality Assessment
