DGIQA: Depth-guided Feature Attention and Refinement for Generalizable Image Quality Assessment

Vaishnav Ramesh; Junliang Liu; Haining Wang; and Md Jahidul Islam

arXiv:2505.24002·cs.CV·June 2, 2025

DGIQA: Depth-guided Feature Attention and Refinement for Generalizable Image Quality Assessment

Vaishnav Ramesh, Junliang Liu, Haining Wang, and Md Jahidul Islam

PDF

Open Access 1 Repo

TL;DR

DGIQA introduces a depth-guided attention and feature refinement framework that enhances no-reference image quality assessment, achieving state-of-the-art results and strong generalization to unseen distortions.

Contribution

The paper proposes Depth-CAR and TCB mechanisms to incorporate depth and global context, improving feature discrimination and generalization in NR-IQA.

Findings

01

Achieves SOTA performance on benchmark datasets.

02

Outperforms existing models in cross-dataset evaluations.

03

Effective in assessing natural distortions like low-light and haze.

Abstract

A long-held challenge in no-reference image quality assessment (NR-IQA) learning from human subjective perception is the lack of objective generalization to unseen natural distortions. To address this, we integrate a novel Depth-Guided cross-attention and refinement (Depth-CAR) mechanism, which distills scene depth and spatial features into a structure-aware representation for improved NR-IQA. This brings in the knowledge of object saliency and relative contrast of the scene for more discriminative feature learning. Additionally, we introduce the idea of TCB (Transformer-CNN Bridge) to fuse high-level global contextual dependencies from a transformer backbone with local spatial features captured by a set of hierarchical CNN (convolutional neural network) layers. We implement TCB and Depth-CAR as multimodal attention-based projection functions to select the most informative features,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uf-robopi/dgiqa
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection · Image Processing Techniques and Applications · Medical Image Segmentation Techniques

MethodsSparse Evolutionary Training