MathNet: A Data-Centric Approach for Printed Mathematical Expression   Recognition

Felix M. Schmitt-Koopmann; Elaine M. Huang; Hans-Peter Hutter; Thilo; Stadelmann; Alireza Darvishy

arXiv:2404.13667·cs.CV·June 10, 2024

MathNet: A Data-Centric Approach for Printed Mathematical Expression Recognition

Felix M. Schmitt-Koopmann, Elaine M. Huang, Hans-Peter Hutter, Thilo, Stadelmann, Alireza Darvishy

PDF

1 Repo

TL;DR

MathNet introduces a data-centric approach with enhanced LaTeX normalization and diverse datasets, significantly improving printed mathematical expression recognition accuracy across multiple benchmarks.

Contribution

The paper presents a novel LaTeX normalization method, an expanded dataset with multiple fonts, and a new MER model, MathNet, achieving state-of-the-art results.

Findings

01

MathNet outperforms previous models by up to 88.3% on test sets.

02

Enhanced LaTeX normalization reduces ground truth variability.

03

Diverse font dataset improves model generalization.

Abstract

Printed mathematical expression recognition (MER) models are usually trained and tested using LaTeX-generated mathematical expressions (MEs) as input and the LaTeX source code as ground truth. As the same ME can be generated by various different LaTeX source codes, this leads to unwanted variations in the ground truth data that bias test performance results and hinder efficient learning. In addition, the use of only one font to generate the MEs heavily limits the generalization of the reported results to realistic scenarios. We propose a data-centric approach to overcome this problem, and present convincing experimental results: Our main contribution is an enhanced LaTeX normalization to map any LaTeX ME to a canonical form. Based on this process, we developed an improved version of the benchmark dataset im2latex-100k, featuring 30 fonts instead of one. Second, we introduce the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

felix-schmitt/MathNet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.