AI Outperforms Humans in Personalized Image Aesthetics Assessment via LLM-Based Interviews and Semantic Feature Extraction
Yoshia Abe, Tatsuya Daikoku, Yasuo Kuniyoshi

TL;DR
This paper presents an AI system that uses LLM-based interviews and semantic features to outperform humans in predicting individual image aesthetic preferences.
Contribution
The study introduces an integrated DL-LLM system that actively elicits preferences and predicts aesthetics, outperforming traditional models and human predictions.
Findings
The system outperforms conventional models, humans, and self-re-evaluations.
Prediction error is smaller than within-person variability.
AI better captures individual aesthetic preferences than humans.
Abstract
Accurately predicting individual aesthetic evaluation for images is a fundamental challenge for AI. Various deep learning (DL)-based models have been proposed for this task, training on image evaluation data to extract objective low-level features. However, aesthetic preferences are inherently subjective and individual-dependent. Accurate prediction thus requires the extraction of high-level semantic features of images and the active collection of preference information from the target individual. To address this issue, we focus on the utility of Large Language Models (LLMs) pretrained on vast amounts of textual data, and develop an integrated DL-LLM system. The system actively elicits aesthetic preferences through LLM-based semi-structured interviews and predicts aesthetic evaluation by leveraging both low-level and high-level features. In our experiments, we compare the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
