Improving the Representation and Conversion of Mathematical Formulae by   Considering their Textual Context

Moritz Schubotz; Andre Greiner-Petter; Philipp Scharpf; Norman; Meuschke; Howard Cohl; Bela Gipp

arXiv:1804.04956·cs.DL·April 16, 2018

Improving the Representation and Conversion of Mathematical Formulae by Considering their Textual Context

Moritz Schubotz, Andre Greiner-Petter, Philipp Scharpf, Norman, Meuschke, Howard Cohl, Bela Gipp

PDF

1 Repo

TL;DR

This paper introduces a benchmark dataset and a new method that leverages textual context to improve the accuracy of mathematical formula format conversions, aiding semantic understanding and retrieval.

Contribution

It provides an open benchmark dataset, evaluates existing tools, and proposes a context-aware approach to enhance mathematical format conversion accuracy.

Findings

01

Context-aware conversion reduces error rates

02

Benchmark dataset enables future research

03

Linked components facilitate semantic formula understanding

Abstract

Mathematical formulae represent complex semantic information in a concise form. Especially in Science, Technology, Engineering, and Mathematics, mathematical formulae are crucial to communicate information, e.g., in scientific papers, and to perform computations using computer algebra systems. Enabling computers to access the information encoded in mathematical formulae requires machine-readable formats that can represent both the presentation and content, i.e., the semantics, of formulae. Exchanging such information between systems additionally requires conversion methods for mathematical representation formats. We analyze how the semantic enrichment of formulae improves the format conversion process and show that considering the textual context of formulae reduces the error rate of such conversions. Our main contributions are: (1) providing an openly available benchmark dataset for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ag-gipp/MathMLben
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.