One-Shot Diffusion Mimicker for Handwritten Text Generation

Gang Dai; Yifan Zhang; Quhui Ke; Qiangya Guo; Shuangping Huang

arXiv:2409.04004·cs.CV·September 12, 2024

One-Shot Diffusion Mimicker for Handwritten Text Generation

Gang Dai, Yifan Zhang, Quhui Ke, Qiangya Guo, Shuangping Huang

PDF

Open Access 1 Repo

TL;DR

This paper introduces One-DM, a diffusion-based model that can generate handwritten text mimicking a specific style from just a single sample, outperforming previous multi-sample methods.

Contribution

The paper presents a novel one-shot style extraction technique using high-frequency features and integrates it into a diffusion model for high-quality handwritten text generation.

Findings

01

Effective style transfer from one sample in multiple languages

02

Outperforms methods requiring over ten samples

03

High-quality, realistic handwritten text generation

Abstract

Existing handwritten text generation methods often require more than ten handwriting samples as style references. However, in practical applications, users tend to prefer a handwriting generation model that operates with just a single reference sample for its convenience and efficiency. This approach, known as "one-shot generation", significantly simplifies the process but poses a significant challenge due to the difficulty of accurately capturing a writer's style from a single sample, especially when extracting fine details from the characters' edges amidst sparse foreground and undesired background noise. To address this problem, we propose a One-shot Diffusion Mimicker (One-DM) to generate handwritten text that can mimic any calligraphic style with only one reference sample. Inspired by the fact that high-frequency information of the individual sample often contains distinct style…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dailenson/one-dm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Human Motion and Animation

MethodsDiffusion