Automated Discovery of Mathematical Definitions in Text with Deep Neural Networks
Natalia Vanetik, Marina Litvak, Sergey Shevchuk, and Lior Reznik

TL;DR
This paper introduces a deep learning approach using CNN and LSTM to automatically identify one-sentence mathematical definitions in texts, highlighting the importance of syntactic features and domain-specific challenges.
Contribution
It presents a novel dataset for mathematical definition extraction and demonstrates the effectiveness of CNN and LSTM models with syntactic features for this task.
Findings
CNN outperforms other models on the dataset
Syntactic features improve detection accuracy
Cross-domain learning is less effective for mathematical definitions
Abstract
Automatic definition extraction from texts is an important task that has numerous applications in several natural language processing fields such as summarization, analysis of scientific texts, automatic taxonomy generation, ontology generation, concept identification, and question answering. For definitions that are contained within a single sentence, this problem can be viewed as a binary classification of sentences into definitions and non-definitions. In this paper, we focus on automatic detection of one-sentence definitions in mathematical texts, which are difficult to separate from surrounding text. We experiment with several data representations, which include sentence syntactic structure and word embeddings, and apply deep learning methods such as the Convolutional Neural Network (CNN) and the Long Short-Term Memory network (LSTM), in order to identify mathematical definitions.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
MethodsTanh Activation · Memory Network · Sigmoid Activation · Long Short-Term Memory
