Smaller But Better: Unifying Layout Generation with Smaller Large   Language Models

Peirong Zhang; Jiaxin Zhang; Jiahuan Cao; Hongliang Li; Lianwen Jin

arXiv:2502.14005·cs.LG·February 21, 2025

Smaller But Better: Unifying Layout Generation with Smaller Large Language Models

Peirong Zhang, Jiaxin Zhang, Jiahuan Cao, Hongliang Li, Lianwen Jin

PDF

Open Access 1 Repo

TL;DR

This paper introduces LGGPT, a compact 1.5B parameter language model that unifies layout generation tasks using a novel instruction-response format and encoding strategy, achieving high performance efficiently.

Contribution

The paper presents ALI and ULR as a unified I/O template and IQE for efficient layout encoding, enabling a small LLM to excel in diverse layout generation tasks.

Findings

01

LGGPT outperforms larger models in unified layout generation tasks.

02

The proposed encoding and template improve efficiency and performance.

03

A 1.5B model balances proficiency and efficiency effectively.

Abstract

We propose LGGPT, an LLM-based model tailored for unified layout generation. First, we propose Arbitrary Layout Instruction (ALI) and Universal Layout Response (ULR) as the uniform I/O template. ALI accommodates arbitrary layout generation task inputs across multiple layout domains, enabling LGGPT to unify both task-generic and domain-generic layout generation hitherto unexplored. Collectively, ALI and ULR boast a succinct structure that forgoes superfluous tokens typically found in existing HTML-based formats, facilitating efficient instruction tuning and boosting unified generation performance. In addition, we propose an Interval Quantization Encoding (IQE) strategy that compresses ALI into a more condensed structure. IQE precisely preserves valid layout clues while eliminating the less informative placeholders, facilitating LGGPT to capture complex and variable layout generation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

niceringnode/lggpt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsAdversarially Learned Inference