TUSQ: Targeted High-Utility Sequence Querying
Chunkai Zhang, Zilin Du, Quanjian Dai, Wensheng Gan, Jian Weng, and, Philip S. Yu

TL;DR
This paper introduces TUSQ, a novel algorithm for targeted high-utility sequence querying that incorporates utility measurement into sequence pattern discovery, outperforming existing methods in efficiency and effectiveness.
Contribution
The paper proposes TUSQ, a new algorithm that integrates utility into sequence querying, with innovative upper bounds and data structures for improved performance.
Findings
TUSQ outperforms baseline algorithms in runtime.
TUSQ uses novel upper bounds for efficient pruning.
TUSQ reduces memory consumption during querying.
Abstract
Significant efforts have been expended in the research and development of a database management system (DBMS) that has a wide range of applications for managing an enormous collection of multisource, heterogeneous, complex, or growing data. Besides the primary function (i.e., create, delete, and update), a practical and impeccable DBMS can interact with users through information selection, that is, querying with their targets. Previous querying algorithms, such as frequent itemset querying and sequential pattern querying (SPQ) have focused on the measurement of frequency, which does not involve the concept of utility, which is helpful for users to discover more informative patterns. To apply the querying technology for wider applications, we incorporate utility into target-oriented SPQ and formulate the task of targeted utility-oriented sequence querying. To address the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Data Management and Algorithms · Data Mining Algorithms and Applications
