Mining Patterns with a Balanced Interval

Edgar de Graaf Joost Kok Walter Kosters

arXiv:0705.1110·cs.AI·May 23, 2007

Mining Patterns with a Balanced Interval

Edgar de Graaf Joost Kok Walter Kosters

PDF

Open Access

TL;DR

This paper introduces methods to identify patterns that occur at regular, balanced intervals in datasets, using statistical measures like standard deviation and average to improve pattern mining efficiency.

Contribution

It proposes a new approach for mining balanced interval patterns using standard deviation and average, along with a simplified pruning method for better threshold estimation.

Findings

01

Effective identification of balanced interval patterns.

02

Simplified pruning approach improves pattern selection.

03

Applicable to various datasets with periodic behaviors.

Abstract

In many applications it will be useful to know those patterns that occur with a balanced interval, e.g., a certain combination of phone numbers are called almost every Friday or a group of products are sold a lot on Tuesday and Thursday. In previous work we proposed a new measure of support (the number of occurrences of a pattern in a dataset), where we count the number of times a pattern occurs (nearly) in the middle between two other occurrences. If the number of non-occurrences between two occurrences of a pattern stays almost the same then we call the pattern balanced. It was noticed that some very frequent patterns obviously also occur with a balanced interval, meaning in every transaction. However more interesting patterns might occur, e.g., every three transactions. Here we discuss a solution using standard deviation and average. Furthermore we propose a simpler approach for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Mining Algorithms and Applications · Time Series Analysis and Forecasting · Advanced Database Systems and Queries