Efficient Model Selection for Predictive Pattern Mining Model by Safe Pattern Pruning
Takumi Yoshida, Hiroyuki Hanada, Kazuya Nakagawa, Kouichi Taji, Koji, Tsuda, Ichiro Takeuchi

TL;DR
This paper introduces Safe Pattern Pruning (SPP), a novel method to efficiently select predictive patterns in structured data models, significantly reducing the computational complexity in pattern mining tasks.
Contribution
The paper proposes the SPP method to effectively prune patterns during model building, addressing the exponential growth challenge in predictive pattern mining.
Findings
SPP reduces the number of patterns needed for accurate models.
Experimental results show improved efficiency in regression and classification tasks.
SPP is applicable to sets, graphs, and sequences.
Abstract
Predictive pattern mining is an approach used to construct prediction models when the input is represented by structured data, such as sets, graphs, and sequences. The main idea behind predictive pattern mining is to build a prediction model by considering substructures, such as subsets, subgraphs, and subsequences (referred to as patterns), present in the structured data as features of the model. The primary challenge in predictive pattern mining lies in the exponential growth of the number of patterns with the complexity of the structured data. In this study, we propose the Safe Pattern Pruning (SPP) method to address the explosion of pattern numbers in predictive pattern mining. We also discuss how it can be effectively employed throughout the entire model building process in practical data analysis. To demonstrate the effectiveness of the proposed method, we conduct numerical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications
MethodsPruning
