SHAMISA: SHAped Modeling of Implicit Structural Associations for Self-supervised No-Reference Image Quality Assessment
Mahdi Naseri, Zhou Wang

TL;DR
SHAMISA introduces a self-supervised, relation-based framework for no-reference image quality assessment that leverages structured relational supervision and synthetic distortions to improve generalization without human labels.
Contribution
It proposes a novel implicit structural association approach with a compositional distortion engine, enhancing NR-IQA performance and robustness without relying on human perceptual labels.
Findings
Achieves strong performance on multiple NR-IQA benchmarks.
Demonstrates improved cross-dataset generalization.
Operates without human quality annotations or contrastive losses.
Abstract
No-Reference Image Quality Assessment (NR-IQA) aims to estimate perceptual quality without access to a reference image of pristine quality. Learning an NR-IQA model faces a fundamental bottleneck: its need for a large number of costly human perceptual labels. We propose SHAMISA, a non-contrastive self-supervised framework that learns from unlabeled distorted images by leveraging explicitly structured relational supervision. Unlike prior methods that impose rigid, binary similarity constraints, SHAMISA introduces implicit structural associations, defined as soft, controllable relations that are both distortion-aware and content-sensitive, inferred from synthetic metadata and intrinsic feature structure. A key innovation is our compositional distortion engine, which generates an uncountable family of degradations from continuous parameter spaces, grouped so that only one distortion factor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Video Quality Assessment · Image Enhancement Techniques · Visual Attention and Saliency Detection
