UIT-HWDB: Using Transferring Method to Construct A Novel Benchmark for Evaluating Unconstrained Handwriting Image Recognition in Vietnamese
Nghia Hieu Nguyen, Duong T.D. Vo, Kiet Van Nguyen

TL;DR
This paper introduces a novel transferring method to create a high-quality synthetic Vietnamese handwriting dataset, enabling more effective evaluation of recognition methods amid language-specific complexities.
Contribution
It proposes a transferring approach to generate a complex Vietnamese handwriting dataset, addressing resource scarcity and evaluation challenges in offline handwriting recognition.
Findings
The synthetic dataset effectively captures natural handwriting attributes.
State-of-the-art methods face significant challenges on the new dataset.
The dataset facilitates more realistic evaluation of recognition techniques.
Abstract
Recognizing handwriting images is challenging due to the vast variation in writing style across many people and distinct linguistic aspects of writing languages. In Vietnamese, besides the modern Latin characters, there are accent and letter marks together with characters that draw confusion to state-of-the-art handwriting recognition methods. Moreover, as a low-resource language, there are not many datasets for researching handwriting recognition in Vietnamese, which makes handwriting recognition in this language have a barrier for researchers to approach. Recent works evaluated offline handwriting recognition methods in Vietnamese using images from an online handwriting dataset constructed by connecting pen stroke coordinates without further processing. This approach obviously can not measure the ability of recognition methods effectively, as it is trivial and may be lack of features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Hand Gesture Recognition Systems · Natural Language Processing Techniques
