Automatic Training Data Synthesis for Handwriting Recognition Using the Structural Crossing-Over Technique
Sirisak Visessenee, Sanparith Marukatat, and Rachada Kongkachandra

TL;DR
This paper introduces the Structural Crossing-Over technique for generating diverse synthetic handwriting data, significantly improving recognition accuracy over existing methods by creating more varied training samples.
Contribution
The paper proposes a novel data synthesis method that combines structural crossing-over of character pairs to enhance handwriting recognition training datasets.
Findings
Achieved 8.06% recognition error rate using the synthesized data.
Outperformed tangent-based affine transformation and original MNIST data.
Generated more diverse training characters, improving recognition accuracy.
Abstract
The paper presents a novel technique called "Structural Crossing-Over" to synthesize qualified data for training machine learning-based handwriting recognition. The proposed technique can provide a greater variety of patterns of training data than the existing approaches such as elastic distortion and tangent-based affine transformation. A couple of training characters are chosen, then they are analyzed by their similar and different structures, and finally are crossed over to generate the new characters. The experiments are set to compare the performances of tangent-based affine transformation and the proposed approach in terms of the variety of generated characters and percent of recognition errors. The standard MNIST corpus including 60,000 training characters and 10,000 test characters is employed in the experiments. The proposed technique uses 1,000 characters to synthesize 60,000…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Vehicle License Plate Recognition
