DeepArt: A Benchmark to Advance Fidelity Research in AI-Generated   Content

Wentao Wang; Xuanyao Huang; Tianyang Wang; Swalpa Kumar Roy

arXiv:2312.10407·cs.CV·December 27, 2023·1 cites

DeepArt: A Benchmark to Advance Fidelity Research in AI-Generated Content

Wentao Wang, Xuanyao Huang, Tianyang Wang, Swalpa Kumar Roy

PDF

Open Access

TL;DR

This paper introduces DeepArt, a benchmark dataset for evaluating the fidelity of AI-generated images by GPT-4, highlighting its limitations and providing a new resource for advancing research in AI-generated content.

Contribution

It provides the first detailed analysis of GPT-4's image synthesis fidelity, revealing its limitations and establishing a new benchmark dataset for future research.

Findings

01

GPT-4 shows notable limitations in image synthesis fidelity.

02

The benchmark dataset enables systematic evaluation of AI-generated images.

03

The study offers both qualitative and quantitative insights into GPT-4's image generation capabilities.

Abstract

This paper explores the image synthesis capabilities of GPT-4, a leading multi-modal large language model. We establish a benchmark for evaluating the fidelity of texture features in images generated by GPT-4, comprising manually painted pictures and their AI-generated counterparts. The contributions of this study are threefold: First, we provide an in-depth analysis of the fidelity of image synthesis features based on GPT-4, marking the first such study on this state-of-the-art model. Second, the quantitative and qualitative experiments fully reveals the limitations of the GPT-4 model in image synthesis. Third, we have compiled a unique benchmark of manual drawings and corresponding GPT-4-generated images, introducing a new task to advance fidelity research in AI-generated content (AIGC). The dataset is available at: \url{https://github.com/rickwang28574/DeepArt}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Topic Modeling

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Label Smoothing · Adam · Layer Normalization · Residual Connection