Artistic Intelligence: A Diffusion-Based Framework for High-Fidelity Landscape Painting Synthesis
Wanggong Yang, Yifei Zhao

TL;DR
This paper introduces LPGen, a diffusion-based model for high-fidelity landscape painting synthesis that uses decoupled attention and structural control to produce detailed, stylistically coherent artworks.
Contribution
It presents a novel diffusion framework with decoupled cross-attention and a structural controller, advancing AI-generated landscape art with improved fidelity and style control.
Findings
LPGen outperforms existing models in structural accuracy.
The model produces stylistically coherent landscape paintings.
Extensive evaluations validate the effectiveness of the approach.
Abstract
Generating high-fidelity landscape paintings remains a challenging task that requires precise control over both structure and style. In this paper, we present LPGen, a novel diffusion-based model specifically designed for landscape painting generation. LPGen introduces a decoupled cross-attention mechanism that independently processes structural and stylistic features, effectively mimicking the layered approach of traditional painting techniques. Additionally, LPGen proposes a structural controller, a multi-scale encoder designed to control the layout of landscape paintings, striking a balance between aesthetics and composition. Besides, the model is pre-trained on a curated dataset of high-resolution landscape images, categorized by distinct artistic styles, and then fine-tuned to ensure detailed and consistent output. Through extensive evaluations, LPGen demonstrates superior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAesthetic Perception and Analysis · Architecture and Art History Studies · Color perception and design
MethodsLatent Diffusion Model · Diffusion
