VisualCritic: Making LMMs Perceive Visual Quality Like Humans
Zhipeng Huang, Zhizheng Zhang, Yiting Lu, Zheng-Jun Zha, Zhibo Chen,, Baining Guo

TL;DR
VisualCritic is a novel large multimodal model capable of perceiving and assessing visual quality in images similarly to humans, covering diverse metrics and providing explainable feedback without dataset-specific tuning.
Contribution
This paper introduces VisualCritic, the first LMM designed for broad-spectrum image quality assessment, capable of quantitative and qualitative evaluation without dataset-specific adaptation.
Findings
VisualCritic accurately predicts Mean Opinion Scores (MOS) for images.
It can distinguish between AI-generated and photographic images.
Outperforms existing models in cross-dataset visual quality assessment.
Abstract
At present, large multimodal models (LMMs) have exhibited impressive generalization capabilities in understanding and generating visual signals. However, they currently still lack sufficient capability to perceive low-level visual quality akin to human perception. Can LMMs achieve this and show the same degree of generalization in this regard? If so, not only could the versatility of LMMs be further enhanced, but also the challenge of poor cross-dataset performance in the field of visual quality assessment could be addressed. In this paper, we explore this question and provide the answer "Yes!". As the result of this initial exploration, we present VisualCritic, the first LMM for broad-spectrum image subjective quality assessment. VisualCritic can be used across diverse data right out of box, without any requirements of dataset-specific adaptation operations like conventional specialist…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Visualization and Analytics · Digital Media and Visual Art · Color perception and design
