PosterO: Structuring Layout Trees to Enable Language Models in Generalized Content-Aware Layout Generation

HsiaoYuan Hsu; Yuxin Peng

arXiv:2505.07843·cs.GR·May 28, 2025

PosterO: Structuring Layout Trees to Enable Language Models in Generalized Content-Aware Layout Generation

HsiaoYuan Hsu, Yuxin Peng

PDF

TL;DR

PosterO introduces a novel layout-centric method leveraging large language models to generate diverse, content-aware poster layouts by structuring design elements as trees, achieving state-of-the-art results and supporting generalized design scenarios.

Contribution

It proposes a new approach that structures layout trees in SVG language and uses LLMs for generalized, content-aware poster generation, addressing limitations of prior image-centric methods.

Findings

01

Achieves state-of-the-art performance on multiple benchmarks.

02

Successfully generates diverse, visually appealing poster layouts.

03

Demonstrates effectiveness in generalized, multi-purpose poster design scenarios.

Abstract

In poster design, content-aware layout generation is crucial for automatically arranging visual-textual elements on the given image. With limited training data, existing work focused on image-centric enhancement. However, this neglects the diversity of layouts and fails to cope with shape-variant elements or diverse design intents in generalized settings. To this end, we proposed a layout-centric approach that leverages layout knowledge implicit in large language models (LLMs) to create posters for omnifarious purposes, hence the name PosterO. Specifically, it structures layouts from datasets as trees in SVG language by universal shape, design intent vectorization, and hierarchical node representation. Then, it applies LLMs during inference to predict new layout trees by in-context learning with intent-aligned example selection. After layout trees are generated, we can seamlessly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.