IDEA-Bench: How Far are Generative Models from Professional Designing?
Chen Liang, Lianghua Huang, Jingwu Fang, Huanzhang Dou, Wei Wang,, Zhi-Fan Wu, Yupeng Shi, Junge Zhang, Xin Zhao, Yu Liu

TL;DR
IDEA-Bench is a new comprehensive benchmark designed to evaluate the capabilities of generative models in complex professional design tasks, revealing significant gaps in current model performance and guiding future improvements.
Contribution
The paper introduces IDEA-Bench, a large-scale benchmark with 100 real-world design tasks and 275 test cases, along with evaluation tools and a leaderboard to advance generative model development for design applications.
Findings
Current models achieve only 22.48 and 6.81 scores on IDEA-Bench.
Significant challenges remain in applying generative models to professional design tasks.
Benchmark and evaluation tools are released to foster further research.
Abstract
Real-world design tasks - such as picture book creation, film storyboard development using character sets, photo retouching, visual effects, and font transfer - are highly diverse and complex, requiring deep interpretation and extraction of various elements from instructions, descriptions, and reference images. The resulting images often implicitly capture key features from references or user inputs, making it challenging to develop models that can effectively address such varied tasks. While existing visual generative models can produce high-quality images based on prompts, they face significant limitations in professional design scenarios that involve varied forms and multiple inputs and outputs, even when enhanced with adapters like ControlNets and LoRAs. To address this, we introduce IDEA-Bench, a comprehensive benchmark encompassing 100 real-world design tasks, including rendering,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDesign Education and Practice
