Cohesiveness Relationships to Empower Keyword Search on Tree Data on the Web
Aggeliki Dimitriou, Ananya Dass, Dimitri Theodoratos

TL;DR
This paper introduces cohesive keyword queries for tree data, enhancing expressiveness and result quality while maintaining simplicity, and presents an efficient evaluation algorithm that scales well on large datasets.
Contribution
It proposes a novel cohesive keyword query model with formal semantics and an efficient stack-based evaluation algorithm for tree data.
Findings
Outperforms previous filtering semantics in quality
Efficiently evaluates queries with up to 20 keywords
Scales smoothly on large datasets
Abstract
Keyword search is the most popular querying technique on semistructured data. Keyword queries are simple and con- venient. However, as a consequence of their imprecision, the quality of their answers is poor and the existing algorithms do not scale satisfactorily. In this paper, we introduce the novel concept of cohesive keyword queries for tree data. Intuitively, a cohesiveness relationship on keywords indicates that they should form a cohesive whole in a query result. Cohesive keyword queries allow term nesting and keyword repetition. Although more expressive, they are as simple as flat keyword queries. We provide formal semantics for cohesive keyword queries rank- ing query results on the proximity of the keyword instances. We design a stack based algorithm which efficiently evaluates cohesive keyword queries. Our experiments demonstrate that our approach outperforms in quality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Database Systems and Queries · Advanced Text Analysis Techniques
