SemiHMER: Semi-supervised Handwritten Mathematical Expression Recognition using pseudo-labels
Kehua Chen, Haoyang Shen

TL;DR
SemiHMER introduces a semi-supervised learning framework with consistency regularization and pseudo-labels to improve handwritten mathematical expression recognition, leveraging unlabeled data and a novel counting module.
Contribution
The paper proposes a novel semi-supervised framework with a dual-branch consistency regularization and a global dynamic counting module for improved HMER accuracy.
Findings
Achieved up to 5.47% accuracy improvement on CROHME datasets.
Effectively leverages unlabeled data with pseudo-labels.
Reduces recognition errors in long-distance formulas.
Abstract
In this paper, we study semi-supervised Handwritten Mathematical Expression Recognition (HMER) via exploring both labeled data and extra unlabeled data. We propose a novel consistency regularization framework, termed SemiHMER, which introduces dual-branch semi-supervised learning. Specifically, we enforce consistency between the two networks for the same input image. The pseudo-label, generated by one perturbed recognition network, is utilized to supervise the other network using the standard cross-entropy loss. The SemiHMER consistency encourages high similarity between the predictions of the two perturbed networks for the same input image and expands the training data by leveraging unlabeled data with pseudo-labels. We further introduce a weak-to-strong strategy by applying different levels of augmentation to each branch, effectively expanding the training data and enhancing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Vehicle License Plate Recognition
