Extracting Conceptual Knowledge from Natural Language Text Using Maximum Likelihood Principle
Shipra Sharma, Balwinder Sodhi

TL;DR
This paper introduces a probabilistic model to extract and learn underlying conceptual knowledge from natural language texts, enhancing the understanding and utility of domain-specific knowledge graphs.
Contribution
It proposes a novel Markovian stochastic model incorporating conceptual knowledge as parameters and applies EM-based algorithms to learn this knowledge from text.
Findings
Model successfully uncovers conceptual structures in various texts.
Experiments show the approach aligns with the hypothesis of underlying conceptual frameworks.
Results demonstrate the potential for improved knowledge graph construction and querying.
Abstract
Domain-specific knowledge graphs constructed from natural language text are ubiquitous in today's world. In many such scenarios the base text, from which the knowledge graph is constructed, concerns itself with practical, on-hand, actual or ground-reality information about the domain. Product documentation in software engineering domain are one example of such base texts. Other examples include blogs and texts related to digital artifacts, reports on emerging markets and business models, patient medical records, etc. Though the above sources contain a wealth of knowledge about their respective domains, the conceptual knowledge on which they are based is often missing or unclear. Access to this conceptual knowledge can enormously increase the utility of available data and assist in several tasks such as knowledge graph completion, grounding, querying, etc. Our contributions in this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Topic Modeling · Biomedical Text Mining and Ontologies
