One-Shot Diffusion Mimicker for Handwritten Text Generation
Gang Dai, Yifan Zhang, Quhui Ke, Qiangya Guo, Shuangping Huang

TL;DR
This paper introduces One-DM, a diffusion-based model that can generate handwritten text mimicking a specific style from just a single sample, outperforming previous multi-sample methods.
Contribution
The paper presents a novel one-shot style extraction technique using high-frequency features and integrates it into a diffusion model for high-quality handwritten text generation.
Findings
Effective style transfer from one sample in multiple languages
Outperforms methods requiring over ten samples
High-quality, realistic handwritten text generation
Abstract
Existing handwritten text generation methods often require more than ten handwriting samples as style references. However, in practical applications, users tend to prefer a handwriting generation model that operates with just a single reference sample for its convenience and efficiency. This approach, known as "one-shot generation", significantly simplifies the process but poses a significant challenge due to the difficulty of accurately capturing a writer's style from a single sample, especially when extracting fine details from the characters' edges amidst sparse foreground and undesired background noise. To address this problem, we propose a One-shot Diffusion Mimicker (One-DM) to generate handwritten text that can mimic any calligraphic style with only one reference sample. Inspired by the fact that high-frequency information of the individual sample often contains distinct style…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Human Motion and Animation
MethodsDiffusion
