Nonoverlapping (delta, gamma)-approximate pattern matching
Youxi Wu, Bojing Jian, Yan Li, He Jiang, Xindong Wu

TL;DR
This paper introduces a novel nonoverlapping approximate pattern matching method using (delta, gamma)-distance, improving accuracy over Hamming distance by effectively handling local and global pattern deviations.
Contribution
It presents the NDP scheme with (delta, gamma)-distance, transforming the problem into a local approximate Nettree and proposing the NetNDP algorithm with pruning techniques.
Findings
NetNDP outperforms existing methods in accuracy and efficiency.
The approach effectively handles local and global pattern deviations.
Experimental results validate the algorithm's superior performance.
Abstract
Pattern matching can be used to calculate the support of patterns, and is a key issue in sequential pattern mining (or sequence pattern mining). Nonoverlapping pattern matching means that two occurrences cannot use the same character in the sequence at the same position. Approximate pattern matching allows for some data noise, and is more general than exact pattern matching. At present, nonoverlapping approximate pattern matching is based on Hamming distance, which cannot be used to measure the local approximation between the subsequence and pattern, resulting in large deviations in matching results. To tackle this issue, we present a Nonoverlapping Delta and gamma approximate Pattern matching (NDP) scheme that employs the (delta, gamma)-distance to give an approximate pattern matching, where the local and the global distances do not exceed delta and gamma, respectively. We first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Network Packet Processing and Optimization · Advanced Database Systems and Queries
