BeHGAN: Bengali Handwritten Word Generation from Plain Text Using Generative Adversarial Networks
Md. Rakibul Islam, Md. Kamrozzaman Bhuiyan, Safwan Muntasir, Arifur Rahman Jawad, Most. Sharmin Sultana Samu

TL;DR
This paper introduces BeHGAN, a generative adversarial network model designed to produce diverse Bengali handwritten words from plain text, addressing the scarcity of datasets and advancing Bengali handwriting synthesis.
Contribution
The paper presents a novel GAN-based method for Bengali handwritten word generation and introduces a new dataset with samples from 500 individuals.
Findings
Successfully generated diverse Bengali handwritten words
Demonstrated the model's ability to produce realistic handwriting
Contributed a new dataset for Bengali handwriting research
Abstract
Handwritten Text Recognition (HTR) is a well-established research area. In contrast, Handwritten Text Generation (HTG) is an emerging field with significant potential. This task is challenging due to the variation in individual handwriting styles. A large and diverse dataset is required to generate realistic handwritten text. However, such datasets are difficult to collect and are not readily available. Bengali is the fifth most spoken language in the world. While several studies exist for languages such as English and Arabic, Bengali handwritten text generation has received little attention. To address this gap, we propose a method for generating Bengali handwritten words. We developed and used a self-collected dataset of Bengali handwriting samples. The dataset includes contributions from approximately five hundred individuals across different ages and genders. All images were…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Generative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications
