Synthetic Document Generator for Annotation-free Layout Recognition

Natraj Raman; Sameena Shah; Manuela Veloso

arXiv:2111.06016·cs.CV·July 26, 2022

Synthetic Document Generator for Annotation-free Layout Recognition

Natraj Raman, Sameena Shah, Manuela Veloso

PDF

TL;DR

This paper introduces a synthetic document generator that creates realistic, labeled document images for training layout recognition models, reducing the need for costly manual annotations.

Contribution

It presents a novel hierarchical generative process using Bayesian Networks and stochastic templates to produce diverse, realistic synthetic documents for layout detection training.

Findings

01

Synthetic documents enable training models that perform comparably to those trained on real data.

02

The generative approach captures complex layout variations effectively.

03

The method reduces dependency on expensive manual annotations.

Abstract

Analyzing the layout of a document to identify headers, sections, tables, figures etc. is critical to understanding its content. Deep learning based approaches for detecting the layout structure of document images have been promising. However, these methods require a large number of annotated examples during training, which are both expensive and time consuming to obtain. We describe here a synthetic document generator that automatically produces realistic documents with labels for spatial positions, extents and categories of the layout elements. The proposed generative process treats every physical component of a document as a random variable and models their intrinsic dependencies using a Bayesian Network graph. Our hierarchical formulation using stochastic templates allow parameter sharing between documents for retaining broad themes and yet the distributional characteristics…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.