HUSP-SP: Faster Utility Mining on Sequence Data
Chunkai Zhang, Yuting Yang, Zilin Du, Wensheng Gan, and Philip S. Yu

TL;DR
This paper introduces HUSP-SP, an efficient algorithm for high-utility sequential pattern mining that uses a novel compact data structure and pruning strategies to significantly improve performance on large datasets.
Contribution
It proposes a new compact sequence projection structure and pruning strategies that enhance the efficiency of high-utility sequential pattern mining algorithms.
Findings
HUSP-SP outperforms existing algorithms in runtime and memory usage.
The seqPro structure effectively reduces search space and computation.
Experimental results demonstrate scalability and efficiency improvements.
Abstract
High-utility sequential pattern mining (HUSPM) has emerged as an important topic due to its wide application and considerable popularity. However, due to the combinatorial explosion of the search space when the HUSPM problem encounters a low utility threshold or large-scale data, it may be time-consuming and memory-costly to address the HUSPM problem. Several algorithms have been proposed for addressing this problem, but they still cost a lot in terms of running time and memory usage. In this paper, to further solve this problem efficiently, we design a compact structure called sequence projection (seqPro) and propose an efficient algorithm, namely discovering high-utility sequential patterns with the seqPro structure (HUSP-SP). HUSP-SP utilizes the compact seq-array to store the necessary information in a sequence database. The seqPro structure is designed to efficiently calculate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Rough Sets and Fuzzy Logic · Data Management and Algorithms
MethodsPruning
