Fast Linking of Mathematical Wikidata Entities in Wikipedia Articles Using Annotation Recommendation
Philipp Scharpf, Moritz Schubotz, Bela Gipp

TL;DR
This paper introduces AnnoMathTeX, an AI-powered system that significantly accelerates the annotation of mathematical formulas and identifiers in Wikipedia articles, enhancing mathematical knowledge base population.
Contribution
The paper presents a novel annotation recommendation system that speeds up mathematical formula and identifier linking in Wikipedia, with evaluation showing notable time savings and integration plans.
Findings
Speeded up annotation by 1.4 times for formulas
Achieved 2.4 times faster annotation for identifiers
Reverted contributions in 12% of articles and 33% of Wikidata items
Abstract
Mathematical information retrieval (MathIR) applications such as semantic formula search and question answering systems rely on knowledge-bases that link mathematical expressions to their natural language names. For database population, mathematical formulae need to be annotated and linked to semantic concepts, which is very time-consuming. In this paper, we present our approach to structure and speed up this process by supporting annotators with a system that suggests formula names and meanings of mathematical identifiers. We test our approach annotating 25 articles on https://en.wikipedia.org. We evaluate the quality and time-savings of the annotation recommendations. Moreover, we watch editor reverts and comments on Wikipedia formula entity links and Wikidata item creation and population to ground the formula semantics. Our evaluation shows that the AI guidance was able to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
