Text-Conditioned Background Generation for Editable Multi-Layer Documents
Taewon Kang, Joseph K J, Chris Tensmeyer, Jihyung Kil, Wanrong Zhu, Ming C. Lin, Vlad I. Morariu

TL;DR
This paper introduces a training-free framework for generating and editing multi-page document backgrounds with maintained readability, thematic consistency, and user-driven stylistic control, using novel diffusion-based and optimization techniques.
Contribution
It proposes a new latent masking approach and Automated Readability Optimization for coherent, editable multi-page document background generation with minimal user intervention.
Findings
Ensures text readability through automated opacity adjustments.
Maintains multi-page thematic consistency via summarization and instruction.
Enables targeted background editing without affecting text integrity.
Abstract
We present a framework for document-centric background generation with multi-page editing and thematic continuity. To ensure text regions remain readable, we employ a \emph{latent masking} formulation that softly attenuates updates in the diffusion space, inspired by smooth barrier functions in physics and numerical optimization. In addition, we introduce \emph{Automated Readability Optimization (ARO)}, which automatically places semi-transparent, rounded backing shapes behind text regions. ARO determines the minimal opacity needed to satisfy perceptual contrast standards (WCAG 2.2) relative to the underlying background, ensuring readability while maintaining aesthetic harmony without human intervention. Multi-page consistency is maintained through a summarization-and-instruction process, where each page is distilled into a compact representation that recursively guides subsequent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Humanities and Scholarship · Generative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques
