MetaWriter: Personalized Handwritten Text Recognition Using Meta-Learned Prompt Tuning
Wenhao Gu, Li Gu, Ching Yee Suen, Yang Wang

TL;DR
MetaWriter introduces a prompt tuning approach with meta-learning for personalized handwritten text recognition, enabling style adaptation with minimal parameter updates and unlabeled data, outperforming prior methods on standard benchmarks.
Contribution
The paper presents a novel prompt tuning framework with meta-learning for efficient personalization in handwritten text recognition, reducing parameter updates and annotation needs.
Findings
Outperforms previous state-of-the-art on RIMES and IAM datasets.
Uses less than 1% of model parameters for personalization.
Achieves 20x fewer parameters compared to existing methods.
Abstract
Recent advancements in handwritten text recognition (HTR) have enabled the effective conversion of handwritten text to digital formats. However, achieving robust recognition across diverse writing styles remains challenging. Traditional HTR methods lack writer-specific personalization at test time due to limitations in model architecture and training strategies. Existing attempts to bridge this gap, through gradient-based meta-learning, still require labeled examples and suffer from parameter-inefficient fine-tuning, leading to substantial computational and memory overhead. To overcome these challenges, we propose an efficient framework that formulates personalization as prompt tuning, incorporating an auxiliary image reconstruction task with a self-supervised loss to guide prompt adaptation with unlabeled test-time examples. To ensure self-supervised loss effectively minimizes text…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Speech Recognition and Synthesis · Image Processing and 3D Reconstruction
