SHAMISA: SHAped Modeling of Implicit Structural Associations for Self-supervised No-Reference Image Quality Assessment

Mahdi Naseri; Zhou Wang

arXiv:2603.13669·cs.CV·March 18, 2026

SHAMISA: SHAped Modeling of Implicit Structural Associations for Self-supervised No-Reference Image Quality Assessment

Mahdi Naseri, Zhou Wang

PDF

Open Access

TL;DR

SHAMISA introduces a self-supervised, relation-based framework for no-reference image quality assessment that leverages structured relational supervision and synthetic distortions to improve generalization without human labels.

Contribution

It proposes a novel implicit structural association approach with a compositional distortion engine, enhancing NR-IQA performance and robustness without relying on human perceptual labels.

Findings

01

Achieves strong performance on multiple NR-IQA benchmarks.

02

Demonstrates improved cross-dataset generalization.

03

Operates without human quality annotations or contrastive losses.

Abstract

No-Reference Image Quality Assessment (NR-IQA) aims to estimate perceptual quality without access to a reference image of pristine quality. Learning an NR-IQA model faces a fundamental bottleneck: its need for a large number of costly human perceptual labels. We propose SHAMISA, a non-contrastive self-supervised framework that learns from unlabeled distorted images by leveraging explicitly structured relational supervision. Unlike prior methods that impose rigid, binary similarity constraints, SHAMISA introduces implicit structural associations, defined as soft, controllable relations that are both distortion-aware and content-sensitive, inferred from synthetic metadata and intrinsic feature structure. A key innovation is our compositional distortion engine, which generates an uncountable family of degradations from continuous parameter spaces, grouped so that only one distortion factor…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Video Quality Assessment · Image Enhancement Techniques · Visual Attention and Saliency Detection