Pattern Generation Strategies for Improving Recognition of Handwritten Mathematical Expressions
Anh Duc Le, Bipin Indurkhya, Masaki Nakagawa

TL;DR
This paper introduces pattern generation strategies that create shape and structural variations in handwritten mathematical expressions to enhance recognition accuracy, especially when training data is limited.
Contribution
It proposes novel data augmentation techniques combining distortions and decomposition to improve recognition systems for handwritten math expressions.
Findings
Hybrid strategy outperforms individual methods in recognition accuracy.
Achieved classification rates of 48.78% and 45.60% on CROHME 2014 and 2016.
Generated datasets are publicly available for further research.
Abstract
Recognition of Handwritten Mathematical Expressions (HMEs) is a challenging problem because of the ambiguity and complexity of two-dimensional handwriting. Moreover, the lack of large training data is a serious issue, especially for academic recognition systems. In this paper, we propose pattern generation strategies that generate shape and structural variations to improve the performance of recognition systems based on a small training set. For data generation, we employ the public databases: CROHME 2014 and 2016 of online HMEs. The first strategy employs local and global distortions to generate shape variations. The second strategy decomposes an online HME into sub-online HMEs to get more structural variations. The hybrid strategy combines both these strategies to maximize shape and structural variations. The generated online HMEs are converted to images for offline HME recognition.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Multimodal Machine Learning Applications
