E-comIQ-ZH: A Human-Aligned Dataset and Benchmark for Fine-Grained Evaluation of E-commerce Posters with Chain-of-Thought
Meiqi Sun, Mingyu Li, Junxiong Zhu

TL;DR
This paper introduces E-comIQ-ZH, a comprehensive dataset and benchmark for evaluating Chinese e-commerce posters, addressing the gap in automated quality assessment for complex Chinese characters and functional design criteria.
Contribution
It presents the first multi-dimensional dataset with expert rationales, a specialized evaluation model, and an automated benchmark for Chinese e-commerce poster quality assessment.
Findings
E-comIQ-M aligns closely with human expert judgments.
The benchmark enables scalable automated evaluation.
The dataset facilitates future research in Chinese e-commerce design assessment.
Abstract
Generative AI is widely used to create commercial posters. However, rapid advances in generation have outpaced automated quality assessment. Existing models emphasize generic esthetics or low level distortions and lack the functional criteria required for e-commerce design. It is especially challenging for Chinese content, where complex characters often produce subtle but critical textual artifacts that are overlooked by existing methods. To address this, we introduce E-comIQ-ZH, a framework for evaluating Chinese e-commerce posters. We build the first dataset E-comIQ-18k to feature multi dimensional scores and expert calibrated Chain of Thought (CoT) rationales. Using this dataset, we train E-comIQ-M, a specialized evaluation model that aligns with human expert judgment. Our framework enables E-comIQ-Bench, the first automated and scalable benchmark for the generation of Chinese…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Generative Adversarial Networks and Image Synthesis
