LayoutFormer++: Conditional Graphic Layout Generation via Constraint Serialization and Decoding Space Restriction
Zhaoyun Jiang, Jiaqi Guo, Shizhao Sun, Huayu Deng, Zhongkai Wu, Vuksan, Mijovic, Zijiang James Yang, Jian-Guang Lou, Dongmei Zhang

TL;DR
LayoutFormer++ introduces a novel Transformer-based framework that flexibly handles diverse user constraints in graphic layout generation through constraint serialization and a decoding space restriction strategy, improving quality and adherence.
Contribution
It proposes a constraint serialization scheme and a decoding space restriction method within a Transformer framework for flexible, high-quality conditional layout generation.
Findings
Outperforms existing methods in quality and constraint adherence
Effectively handles diverse user constraints uniformly
Reduces constraint violations in generated layouts
Abstract
Conditional graphic layout generation, which generates realistic layouts according to user constraints, is a challenging task that has not been well-studied yet. First, there is limited discussion about how to handle diverse user constraints flexibly and uniformly. Second, to make the layouts conform to user constraints, existing work often sacrifices generation quality significantly. In this work, we propose LayoutFormer++ to tackle the above problems. First, to flexibly handle diverse constraints, we propose a constraint serialization scheme, which represents different user constraints as sequences of tokens with a predefined format. Then, we formulate conditional layout generation as a sequence-to-sequence transformation, and leverage encoder-decoder framework with Transformer as the basic architecture. Furthermore, to make the layout better meet user requirements without harming…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Advanced Image and Video Retrieval Techniques · Computer Graphics and Visualization Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Absolute Position Encodings · Softmax · Residual Connection · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer
