Combining Named Entities with WordNet and Using Query-Oriented Spreading Activation for Semantic Text Search
Vuong M. Ngo, Tru H. Cao, Tuan M.V. Le

TL;DR
This paper introduces an ontology-based vector space model that enhances semantic text search by integrating named entities and WordNet features, and employs a query-oriented spreading activation algorithm to improve retrieval accuracy.
Contribution
It presents a novel generalized vector space model that combines ontological features of named entities and WordNet, along with a query expansion technique using spreading activation.
Findings
Model outperforms keyword-based search by 42.5% in MAP.
Incorporating ontologies improves search precision over using only WordNet or named entities.
Experimental results demonstrate significant accuracy gains in semantic retrieval.
Abstract
Purely keyword-based text search is not satisfactory because named entities and WordNet words are also important elements to define the content of a document or a query in which they occur. Named entities have ontological features, namely, their aliases, classes, and identifiers. Words in WordNet also have ontological features, namely, their synonyms, hypernyms, hyponyms, and senses. Those features of concepts may be hidden from their textual appearance. Besides, there are related concepts that do not appear in a query, but can bring out the meaning of the query if they are added. We propose an ontology-based generalized Vector Space Model to semantic text search. It exploits ontological features of named entities and WordNet words, and develops a query-oriented spreading activation algorithm to expand queries. In addition, it combines and utilizes advantages of different ontologies for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Web Data Mining and Analysis · Natural Language Processing Techniques
