Explainable Fuzzy Utility Mining on Sequences
Wensheng Gan, Zilin Du, Weiping Ding, Chunkai Zhang, and Han-Chieh, Chao

TL;DR
This paper introduces PGFUM, a novel fuzzy utility mining method for sequence data that enhances explainability and efficiency, providing human-understandable models for decision-making in data science.
Contribution
The study proposes a new fuzzy utility mining algorithm for sequences, with compressed data structures and pruning strategies, improving interpretability and computational performance over existing methods.
Findings
PGFUM achieves higher efficiency in runtime and memory usage.
It produces human-explainable, linguistically meaningful high-utility sequences.
Experimental results outperform the existing PFUS method.
Abstract
Fuzzy systems have good modeling capabilities in several data science scenarios, and can provide human-explainable intelligence models with explainability and interpretability. In contrast to transaction data, which have been extensively studied, sequence data are more common in real-life applications. To obtain a human-explainable data intelligence model for decision making, in this study, we investigate explainable fuzzy-theoretic utility mining on multi-sequences. Meanwhile, a more normative formulation of the problem of fuzzy utility mining on sequences is formulated. By exploring fuzzy set theory for utility mining, we propose a novel method termed pattern growth fuzzy utility mining (PGFUM) for mining fuzzy high-utility sequences with linguistic meaning. In the case of sequence data, PGFUM reflects the fuzzy quantity and utility regions of sequences. To improve the efficiency and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Rough Sets and Fuzzy Logic · Imbalanced Data Classification Techniques
