TL;DR
This paper introduces MA-AGIQA, a multi-modality model that enhances AI-generated image quality assessment by incorporating semantic understanding through text prompts and a mixture of experts, outperforming existing models.
Contribution
The paper proposes a novel multi-modality approach with semantic guidance and a mixture of experts to improve AGI quality assessment, addressing limitations of traditional DNN-based models.
Findings
Achieves state-of-the-art performance on AIGCQA-20k and AGIQA-3k datasets.
Demonstrates superior generalization in assessing AI-generated images.
Effectively integrates semantic information with quality features.
Abstract
Traditional deep neural network (DNN)-based image quality assessment (IQA) models leverage convolutional neural networks (CNN) or Transformer to learn the quality-aware feature representation, achieving commendable performance on natural scene images. However, when applied to AI-Generated images (AGIs), these DNN-based IQA models exhibit subpar performance. This situation is largely due to the semantic inaccuracies inherent in certain AGIs caused by uncontrollable nature of the generation process. Thus, the capability to discern semantic content becomes crucial for assessing the quality of AGIs. Traditional DNN-based IQA models, constrained by limited parameter complexity and training data, struggle to capture complex fine-grained semantic features, making it challenging to grasp the existence and coherence of semantic content of the entire image. To address the shortfall in semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Dropout · Residual Connection · Softmax · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Absolute Position Encodings · Linear Layer · Dense Connections · Label Smoothing
