Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment
Tianwei Zhou, Songbai Tan, Wei Zhou, Yu Luo, Yuan-Gen Wang, and, Guanghui Yue

TL;DR
This paper introduces AMFF-Net, a novel blind AI-generated image quality assessment model that evaluates images based on visual quality, authenticity, and consistency using multi-scale features and adaptive fusion, outperforming existing methods.
Contribution
The paper proposes a new multi-scale feature fusion network with adaptive weighting for blind IQA of AI-generated images, incorporating text-image semantic alignment.
Findings
AMFF-Net outperforms nine state-of-the-art blind IQA methods.
Multi-scale input strategy improves assessment accuracy.
Adaptive feature fusion effectively combines local and global features.
Abstract
With the increasing maturity of the text-to-image and image-to-image generative models, AI-generated images (AGIs) have shown great application potential in advertisement, entertainment, education, social media, etc. Although remarkable advancements have been achieved in generative models, very few efforts have been paid to design relevant quality assessment models. In this paper, we propose a novel blind image quality assessment (IQA) network, named AMFF-Net, for AGIs. AMFF-Net evaluates AGI quality from three dimensions, i.e., "visual quality", "authenticity", and "consistency". Specifically, inspired by the characteristics of the human visual system and motivated by the observation that "visual quality" and "authenticity" are characterized by both local and global aspects, AMFF-Net scales the image up and down and takes the scaled images and original-sized image as the inputs to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Image and Video Quality Assessment · Visual Attention and Saliency Detection
