A Heuristically Modified FP-Tree for Ontology Learning with Applications in Education
Safwan Shatnawi, Mohamed Medhat Gaber, Mihaela Cocea

TL;DR
This paper introduces a heuristically modified FP-Tree for ontology learning from unstructured text, utilizing DFA for concept extraction and rule-based pattern mining, which improves question-answering accuracy in educational contexts.
Contribution
It presents a novel heuristic approach combining DFA-based concept extraction and pattern mining for ontology learning, enhancing question-answering systems in education.
Findings
80% question answering success with our ontology
Compared to Text2Onto, our method significantly improves coverage
Latent Semantic Analysis aligns best with expert ratings
Abstract
We propose a heuristically modified FP-Tree for ontology learning from text. Unlike previous research, for concept extraction, we use a regular expression parser approach widely adopted in compiler construction, i.e., deterministic finite automata (DFA). Thus, the concepts are extracted from unstructured documents. For ontology learning, we use a frequent pattern mining approach and employ a rule mining heuristic function to enhance its quality. This process does not rely on predefined lexico-syntactic patterns, thus, it is applicable for different subjects. We employ the ontology in a question-answering system for students' content-related questions. For validation, we used textbook questions/answers and questions from online course forums. Subject experts rated the quality of the system's answers on a subset of questions and their ratings were used to identify the most appropriate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Service-Oriented Architecture and Web Services · Natural Language Processing Techniques
