Bayesian Mixture Models for Frequent Itemset Discovery
Ruefei He, Jonathan Shapiro

TL;DR
This paper introduces Bayesian mixture models with Dirichlet priors for more interpretable frequent itemset discovery in binary data, demonstrating improved performance and efficiency over non-Bayesian models through experiments.
Contribution
It develops two Bayesian mixture models with Dirichlet priors for transaction data mining, enhancing interpretability and performance over previous non-Bayesian approaches.
Findings
Bayesian models outperform non-Bayesian mixture models in benchmark tests.
Variational algorithm is faster, Gibbs sampling yields more accurate results.
Dirichlet process model adapts complexity automatically.
Abstract
In binary-transaction data-mining, traditional frequent itemset mining often produces results which are not straightforward to interpret. To overcome this problem, probability models are often used to produce more compact and conclusive results, albeit with some loss of accuracy. Bayesian statistics have been widely used in the development of probability models in machine learning in recent years and these methods have many advantages, including their abilities to avoid overfitting. In this paper, we develop two Bayesian mixture models with the Dirichlet distribution prior and the Dirichlet process (DP) prior to improve the previous non-Bayesian mixture model developed for transaction dataset mining. We implement the inference of both mixture models using two methods: a collapsed Gibbs sampling scheme and a variational approximation algorithm. Experiments in several benchmark problems…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Bayesian Methods and Mixture Models · Rough Sets and Fuzzy Logic
