LayoutFormer++: Conditional Graphic Layout Generation via Constraint   Serialization and Decoding Space Restriction

Zhaoyun Jiang; Jiaqi Guo; Shizhao Sun; Huayu Deng; Zhongkai Wu; Vuksan; Mijovic; Zijiang James Yang; Jian-Guang Lou; Dongmei Zhang

arXiv:2208.08037·cs.CV·March 27, 2023·1 cites

LayoutFormer++: Conditional Graphic Layout Generation via Constraint Serialization and Decoding Space Restriction

Zhaoyun Jiang, Jiaqi Guo, Shizhao Sun, Huayu Deng, Zhongkai Wu, Vuksan, Mijovic, Zijiang James Yang, Jian-Guang Lou, Dongmei Zhang

PDF

Open Access

TL;DR

LayoutFormer++ introduces a novel Transformer-based framework that flexibly handles diverse user constraints in graphic layout generation through constraint serialization and a decoding space restriction strategy, improving quality and adherence.

Contribution

It proposes a constraint serialization scheme and a decoding space restriction method within a Transformer framework for flexible, high-quality conditional layout generation.

Findings

01

Outperforms existing methods in quality and constraint adherence

02

Effectively handles diverse user constraints uniformly

03

Reduces constraint violations in generated layouts

Abstract

Conditional graphic layout generation, which generates realistic layouts according to user constraints, is a challenging task that has not been well-studied yet. First, there is limited discussion about how to handle diverse user constraints flexibly and uniformly. Second, to make the layouts conform to user constraints, existing work often sacrifices generation quality significantly. In this work, we propose LayoutFormer++ to tackle the above problems. First, to flexibly handle diverse constraints, we propose a constraint serialization scheme, which represents different user constraints as sequences of tokens with a predefined format. Then, we formulate conditional layout generation as a sequence-to-sequence transformation, and leverage encoder-decoder framework with Transformer as the basic architecture. Furthermore, to make the layout better meet user requirements without harming…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Advanced Image and Video Retrieval Techniques · Computer Graphics and Visualization Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Absolute Position Encodings · Softmax · Residual Connection · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer