Fine-Tuning a Large Vision-Language Model for Artwork's Scoring and Critique

Zhehan Zhang; Meihua Qian; Li Luo; Siyu Huang; Chaoyi Zhou; Ripon Saha; Xinxin Song

arXiv:2602.13306·cs.CV·February 17, 2026

Fine-Tuning a Large Vision-Language Model for Artwork's Scoring and Critique

Zhehan Zhang, Meihua Qian, Li Luo, Siyu Huang, Chaoyi Zhou, Ripon Saha, Xinxin Song

PDF

Open Access

TL;DR

This paper introduces a fine-tuned vision-language model for automated artwork scoring and critique, achieving high accuracy and providing explanatory feedback, thus advancing scalable creativity assessment methods.

Contribution

It presents a novel multi-task learning framework that combines scoring and critique generation for artwork using a large vision-language model with structured prompts.

Findings

01

Achieves Pearson r > 0.97 in scoring accuracy

02

Provides critiques with semantic similarity to expert feedback

03

Demonstrates scalable automated art assessment

Abstract

Assessing artistic creativity is foundational to creativity research and arts education, yet manual scoring (e.g., Torrance Tests of Creative Thinking) is labor-intensive at scale. Prior machine-learning approaches show promise for visual creativity scoring, but many rely mainly on image features and provide limited or no explanatory feedback. We propose a framework for automated creativity assessment of human paintings by fine-tuning the vision-language model Qwen2-VL-7B with multi-task learning. Our dataset contains 1000 human-created paintings scored on a 1-100 scale and paired with a short human-written description (content or artist explanation). Two expert raters evaluated each work using a five-dimension rubric (originality, color, texture, composition, content) and provided written critiques; we use an 80/20 train-test split. We add a lightweight regression head on the visual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAesthetic Perception and Analysis · Creativity in Education and Neuroscience · Artificial Intelligence in Games