Natural Language Premise Selection: Finding Supporting Statements for   Mathematical Text

Deborah Ferreira; Andre Freitas

arXiv:2004.14959·cs.CL·May 1, 2020·5 cites

Natural Language Premise Selection: Finding Supporting Statements for Mathematical Text

Deborah Ferreira, Andre Freitas

PDF

Open Access 1 Repo

TL;DR

This paper introduces the natural premise selection task to identify supporting statements in mathematical texts, providing a dataset and analyzing the challenges faced by current NLP models in understanding mathematical discourse.

Contribution

It proposes a new NLP task for mathematical texts, introduces the NL-PS dataset, and evaluates baseline models to highlight interpretation challenges.

Findings

01

Baseline models struggle with the complexity of mathematical language.

02

The NL-PS dataset enables evaluation of premise selection methods.

03

Understanding mathematical discourse remains a significant challenge for NLP.

Abstract

Mathematical text is written using a combination of words and mathematical expressions. This combination, along with a specific way of structuring sentences makes it challenging for state-of-art NLP tools to understand and reason on top of mathematical discourse. In this work, we propose a new NLP task, the natural premise selection, which is used to retrieve supporting definitions and supporting propositions that are useful for generating an informal mathematical proof for a particular statement. We also make available a dataset, NL-PS, which can be used to evaluate different approaches for the natural premise selection task. Using different baselines, we demonstrate the underlying interpretation challenges associated with the task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

debymf/nl-ps
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Mathematics, Computing, and Information Processing