Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image   Quality Assessment

Tianwei Zhou; Songbai Tan; Wei Zhou; Yu Luo; Yuan-Gen Wang; and; Guanghui Yue

arXiv:2404.15163·cs.CV·April 24, 2024

Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment

Tianwei Zhou, Songbai Tan, Wei Zhou, Yu Luo, Yuan-Gen Wang, and, Guanghui Yue

PDF

Open Access

TL;DR

This paper introduces AMFF-Net, a novel blind AI-generated image quality assessment model that evaluates images based on visual quality, authenticity, and consistency using multi-scale features and adaptive fusion, outperforming existing methods.

Contribution

The paper proposes a new multi-scale feature fusion network with adaptive weighting for blind IQA of AI-generated images, incorporating text-image semantic alignment.

Findings

01

AMFF-Net outperforms nine state-of-the-art blind IQA methods.

02

Multi-scale input strategy improves assessment accuracy.

03

Adaptive feature fusion effectively combines local and global features.

Abstract

With the increasing maturity of the text-to-image and image-to-image generative models, AI-generated images (AGIs) have shown great application potential in advertisement, entertainment, education, social media, etc. Although remarkable advancements have been achieved in generative models, very few efforts have been paid to design relevant quality assessment models. In this paper, we propose a novel blind image quality assessment (IQA) network, named AMFF-Net, for AGIs. AMFF-Net evaluates AGI quality from three dimensions, i.e., "visual quality", "authenticity", and "consistency". Specifically, inspired by the characteristics of the human visual system and motivated by the observation that "visual quality" and "authenticity" are characterized by both local and global aspects, AMFF-Net scales the image up and down and takes the scaled images and original-sized image as the inputs to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Fusion Techniques · Image and Video Quality Assessment · Visual Attention and Saliency Detection