Latent Diffusion for Guided Document Table Generation
Syed Jawwad Haider Hamdani, Saifullah Saifullah, Stefan Agne, Andreas, Dengel, Sheraz Ahmed

TL;DR
This paper presents a novel latent diffusion-based method for generating annotated document table images, improving training data quality for table structure recognition models and achieving performance comparable to state-of-the-art methods.
Contribution
The study introduces a conditioned latent diffusion approach to generate realistic annotated table images, enhancing synthetic data quality for training object detection models.
Findings
Generated data improves YOLOv5 performance on pubtables-1m.
Synthetic images achieve low FID scores, indicating high realism.
Method yields results comparable to state-of-the-art in table recognition.
Abstract
Obtaining annotated table structure data for complex tables is a challenging task due to the inherent diversity and complexity of real-world document layouts. The scarcity of publicly available datasets with comprehensive annotations for intricate table structures hinders the development and evaluation of models designed for such scenarios. This research paper introduces a novel approach for generating annotated images for table structure by leveraging conditioned mask images of rows and columns through the application of latent diffusion models. The proposed method aims to enhance the quality of synthetic data used for training object detection models. Specifically, the study employs a conditioning mechanism to guide the generation of complex document table images, ensuring a realistic representation of table layouts. To evaluate the effectiveness of the generated data, we employ the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Semantic Web and Ontologies · Recommender Systems and Techniques
MethodsDiffusion
