TKUS: Mining Top-K High-Utility Sequential Patterns

Chunkai Zhang; Zilin Du; Wensheng Gan; Philip S. Yu

arXiv:2011.13454·cs.DB·November 30, 2020·6 cites

TKUS: Mining Top-K High-Utility Sequential Patterns

Chunkai Zhang, Zilin Du, Wensheng Gan, Philip S. Yu

PDF

Open Access

TL;DR

This paper introduces TKUS, a novel algorithm for top-k high-utility sequential pattern mining that efficiently finds the most useful patterns without needing a predefined utility threshold, outperforming previous methods.

Contribution

The paper formulates the top-k HUSPM problem and proposes TKUS, an efficient algorithm with new strategies to improve search space reduction and scalability.

Findings

01

TKUS outperforms existing algorithms in efficiency.

02

The proposed strategies significantly reduce search space.

03

Experimental results validate the effectiveness of TKUS.

Abstract

High-utility sequential pattern mining (HUSPM) has recently emerged as a focus of intense research interest. The main task of HUSPM is to find all subsequences, within a quantitative sequential database, that have high utility with respect to a user-defined minimum utility threshold. However, it is difficult to specify the minimum utility threshold, especially when database features, which are invisible in most cases, are not understood. To handle this problem, top-k HUSPM was proposed. Up to now, only very preliminary work has been conducted to capture top-k HUSPs, and existing strategies require improvement in terms of running time, memory consumption, unpromising candidate filtering, and scalability. Moreover, no systematic problem statement has been defined. In this paper, we formulate the problem of top-k HUSPM and propose a novel algorithm called TKUS. To improve efficiency, TKUS…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Mining Algorithms and Applications · Rough Sets and Fuzzy Logic · Data Management and Algorithms