DesignSense: A Human Preference Dataset and Reward Modeling Framework for Graphic Layout Generation
Varun Gopal, Rishabh Jain, Aradhya Mathur, Nikitha SR, Sohan Patnaik, Sudhir Yarram, Mayur Hemani, Balaji Krishnamurthy, Mausoom Sarkar

TL;DR
This paper introduces DesignSense, a large-scale human preference dataset and a reward modeling framework for graphic layout generation, addressing the gap in aligning models with nuanced human aesthetic judgments.
Contribution
The paper presents a novel dataset, DesignSense-10k, and a specialized reward model that improves layout evaluation and generation quality over existing models.
Findings
DesignSense-10k contains 10,235 human-annotated preference pairs.
The reward model outperforms existing models with a 54.6% improvement in Macro F1.
Using the model in RL training improves layout generator win rate by 3%.
Abstract
Graphic layouts serve as an important and engaging medium for visual communication across different channels. While recent layout generation models have demonstrated impressive capabilities, they frequently fail to align with nuanced human aesthetic judgment. Existing preference datasets and reward models trained on text-to-image generation do not generalize to layout evaluation, where the spatial arrangement of identical elements determines quality. To address this critical gap, we introduce DesignSense-10k, a large-scale dataset of 10,235 human-annotated preference pairs for graphic layout evaluation. We propose a five-stage curation pipeline that generates visually coherent layout transformations across diverse aspect ratios, using semantic grouping, layout prediction, filtering, clustering, and VLM-based refinement to produce high-quality comparison pairs. Human preferences are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · 3D Shape Modeling and Analysis
