READ: Recursive Autoencoders for Document Layout Generation

Akshay Gadi Patil; Omri Ben-Eliezer; Or Perel; Hadar Averbuch-Elor

arXiv:1909.00302·cs.CV·April 20, 2020

READ: Recursive Autoencoders for Document Layout Generation

Akshay Gadi Patil, Omri Ben-Eliezer, Or Perel, Hadar Averbuch-Elor

PDF

TL;DR

READ introduces a recursive autoencoder framework that generates diverse, realistic document layouts by learning structural hierarchies, enabling large-scale layout synthesis and improving detection tasks through data augmentation.

Contribution

The paper presents a novel recursive autoencoder approach for generating plausible document layouts from learned structural hierarchies, enhancing diversity and realism.

Findings

01

Generated layouts are highly variable and realistic.

02

Layout generation improves detection performance when used for data augmentation.

03

The method effectively captures structural similarities among document layouts.

Abstract

Layout is a fundamental component of any graphic design. Creating large varieties of plausible document layouts can be a tedious task, requiring numerous constraints to be satisfied, including local ones relating different semantic elements and global constraints on the general appearance and spacing. In this paper, we present a novel framework, coined READ, for REcursive Autoencoders for Document layout generation, to generate plausible 2D layouts of documents in large quantities and varieties. First, we devise an exploratory recursive method to extract a structural decomposition of a single document. Leveraging a dataset of documents annotated with labeled bounding boxes, our recursive neural network learns to map the structural representation, given in the form of a simple hierarchy, to a compact code, the space of which is approximated by a Gaussian distribution. Novel hierarchies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.