DocSynthv2: A Practical Autoregressive Modeling for Document Generation
Sanket Biswas, Rajiv Jain, Vlad I. Morariu, Jiuxiang Gu, Puneet, Mathur, Curtis Wigington, Tong Sun, Josep Llad\'os

TL;DR
DocSynthv2 introduces an autoregressive model that jointly generates document layout and content, advancing the field of comprehensive document creation without visual cues, and demonstrating improved quality and relevance in generated documents.
Contribution
The paper presents a novel autoregressive model that integrates layout and textual cues for comprehensive document generation, surpassing existing layout-only methods.
Findings
Enhanced document generation quality and relevance
Effective handling of layout and textual content integration
Demonstrated superiority on a curated benchmark
Abstract
While the generation of document layouts has been extensively explored, comprehensive document generation encompassing both layout and content presents a more complex challenge. This paper delves into this advanced domain, proposing a novel approach called DocSynthv2 through the development of a simple yet effective autoregressive structured model. Our model, distinct in its integration of both layout and textual cues, marks a step beyond existing layout-generation approaches. By focusing on the relationship between the structural elements and the textual content within documents, we aim to generate cohesive and contextually relevant documents without any reliance on visual components. Through experimental studies on our curated benchmark for the new task, we demonstrate the ability of our model combining layout and textual information in enhancing the generation quality and relevance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Advanced Text Analysis Techniques
