A Novel Image Similarity Metric for Scene Composition Structure
Md Redwanul Haque, Manzur Murshed, Manoranjan Paul, Tsz-Kwan Lee

TL;DR
This paper introduces SCSSIM, a new analytical metric that accurately measures the preservation of scene composition structure in images, addressing limitations of existing methods and aiding generative AI evaluation.
Contribution
The paper presents SCSSIM, a training-free, statistically-based metric that effectively assesses structural fidelity of scene composition in images, outperforming traditional and neural network-based metrics.
Findings
SCSSIM is invariant to non-structural distortions.
It accurately detects changes in scene composition structure.
Outperforms existing metrics in structural evaluation tasks.
Abstract
The rapid advancement of generative AI models necessitates novel methods for evaluating image quality that extend beyond human perception. A critical concern for these models is the preservation of an image's underlying Scene Composition Structure (SCS), which defines the geometric relationships among objects and the background, their relative positions, sizes, orientations, etc. Maintaining SCS integrity is paramount for ensuring faithful and structurally accurate GenAI outputs. Traditional image similarity metrics often fall short in assessing SCS. Pixel-level approaches are overly sensitive to minor visual noise, while perception-based metrics prioritize human aesthetic appeal, neither adequately capturing structural fidelity. Furthermore, recent neural-network-based metrics introduce training overheads and potential generalization issues. We introduce the SCS Similarity Index…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
