Weakly Supervised Scene Text Generation for Low-resource Languages
Yangchen Xie, Xinyuan Chen, Hongjian Zhan, Palaiahankote Shivakum,, Bing Yin, Cong Liu, Yue Lu

TL;DR
This paper introduces a weakly supervised scene text generation approach for low-resource languages, enabling the creation of diverse training images with minimal labels to improve recognition models.
Contribution
It presents a novel method that leverages few recognition labels, disentangles content and style, and uses an attention module and font classifier to generate diverse, high-quality scene text images.
Findings
Generated data improves recognition accuracy
Method outperforms existing generative approaches
Enhances low-resource language recognition models
Abstract
A large number of annotated training images is crucial for training successful scene text recognition models. However, collecting sufficient datasets can be a labor-intensive and costly process, particularly for low-resource languages. To address this challenge, auto-generating text data has shown promise in alleviating the problem. Unfortunately, existing scene text generation methods typically rely on a large amount of paired data, which is difficult to obtain for low-resource languages. In this paper, we propose a novel weakly supervised scene text generation method that leverages a few recognition-level labels as weak supervision. The proposed method is able to generate a large amount of scene text images with diverse backgrounds and font styles through cross-language generation. Our method disentangles the content and style features of scene text images, with the former…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
