SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Zewen Chen, Juan Wang, Wen Wang, Sunhan Xu, Hang Xiong, Yun Zeng, Jian, Guo, Shuxun Wang, Chunfeng Yuan, Bing Li, Weiming Hu

TL;DR
SEAGULL is a novel vision-language based network for fine-grained, region-specific image quality assessment that leverages large models, specialized datasets, and mask-based features to improve ROI quality analysis.
Contribution
The paper introduces SEAGULL, a new model combining vision-language guidance and mask-based features for ROI IQA, along with two new datasets for training and evaluation.
Findings
SEAGULL outperforms existing methods in ROI quality assessment.
Pre-training on SEAGULL-100w enhances regional quality perception.
Fine-tuning on SEAGULL-3k improves real-world distortion detection.
Abstract
Existing Image Quality Assessment (IQA) methods achieve remarkable success in analyzing quality for overall image, but few works explore quality analysis for Regions of Interest (ROIs). The quality analysis of ROIs can provide fine-grained guidance for image quality improvement and is crucial for scenarios focusing on region-level quality. This paper proposes a novel network, SEAGULL, which can SEe and Assess ROIs quality with GUidance from a Large vision-Language model. SEAGULL incorporates a vision-language model (VLM), masks generated by Segment Anything Model (SAM) to specify ROIs, and a meticulously designed Mask-based Feature Extractor (MFE) to extract global and local tokens for specified ROIs, enabling accurate fine-grained IQA for ROIs. Moreover, this paper constructs two ROI-based IQA datasets, SEAGULL-100w and SEAGULL-3k, for training and evaluating ROI-based IQA.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Infrared Target Detection Methodologies · Image and Video Quality Assessment
