TL;DR
InkDiffuser is a diffusion-based framework that significantly improves the realism and fidelity of one-shot Chinese calligraphy synthesis by enhancing high-frequency details and explicitly modeling ink morphology.
Contribution
It introduces a high-frequency enhancement mechanism and a differentiable ink structure loss to improve stroke rendering and ink morphology in calligraphy generation.
Findings
Outperforms existing few-shot font generation methods in structural consistency.
Produces more realistic ink rendering effects from a single reference glyph.
Demonstrates superior detail fidelity and visual authenticity across styles.
Abstract
Current Chinese calligraphy generation methods suffer from poor stroke rendering and unrealistic ink morphology, resulting in outputs with limited visual fidelity and artistic fluidity. To address this problem, we propose \textbf{InkDiffuser}, a diffusion-based generative framework for one-shot Chinese calligraphy synthesis. To guarantee high-fidelity rendering, we introduce two core contributions: a high-frequency enhancement mechanism and a Differentiable Ink Structure (DIS) loss that explicitly regularizes ink morphology. Inspired by the observation that high-frequency information in individual samples typically carries contour details, we enhance content extraction by explicitly fusing high-frequency representations for more accurate font structure. Furthermore, we propose a differentiable ink structure loss that integrates differentiable morphological operations into the diffusion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
