TL;DR
MathNet introduces a data-centric approach with enhanced LaTeX normalization and diverse datasets, significantly improving printed mathematical expression recognition accuracy across multiple benchmarks.
Contribution
The paper presents a novel LaTeX normalization method, an expanded dataset with multiple fonts, and a new MER model, MathNet, achieving state-of-the-art results.
Findings
MathNet outperforms previous models by up to 88.3% on test sets.
Enhanced LaTeX normalization reduces ground truth variability.
Diverse font dataset improves model generalization.
Abstract
Printed mathematical expression recognition (MER) models are usually trained and tested using LaTeX-generated mathematical expressions (MEs) as input and the LaTeX source code as ground truth. As the same ME can be generated by various different LaTeX source codes, this leads to unwanted variations in the ground truth data that bias test performance results and hinder efficient learning. In addition, the use of only one font to generate the MEs heavily limits the generalization of the reported results to realistic scenarios. We propose a data-centric approach to overcome this problem, and present convincing experimental results: Our main contribution is an enhanced LaTeX normalization to map any LaTeX ME to a canonical form. Based on this process, we developed an improved version of the benchmark dataset im2latex-100k, featuring 30 fonts instead of one. Second, we introduce the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
