Mining Block I/O Traces for Cache Preloading with Sparse Temporal Non-parametric Mixture of Multivariate Poisson
Lavanya Sita Tekumalla, Chiranjib Bhattacharyya

TL;DR
This paper introduces a novel Bayesian non-parametric mixture model for multivariate Poisson count data, leveraging sparsity to efficiently capture long-range temporal patterns in storage traces for improved cache preloading.
Contribution
It proposes the Sparse DP mixture of multivariate Poisson model, extending previous models with sparsity for better efficiency and effectiveness in mining long-range motifs in storage trace data.
Findings
Significant improvement in cache hitrates on benchmark traces.
Effective modeling of long-range temporal correlations in storage data.
Foundation for future data mining techniques in storage system optimization.
Abstract
Existing caching strategies, in the storage domain, though well suited to exploit short range spatio-temporal patterns, are unable to leverage long-range motifs for improving hitrates. Motivated by this, we investigate novel Bayesian non-parametric modeling(BNP) techniques for count vectors, to capture long range correlations for cache preloading, by mining Block I/O traces. Such traces comprise of a sequence of memory accesses that can be aggregated into high-dimensional sparse correlated count vector sequences. While there are several state of the art BNP algorithms for clustering and their temporal extensions for prediction, there has been no work on exploring these for correlated count vectors. Our first contribution addresses this gap by proposing a DP based mixture model of Multivariate Poisson (DP-MMVP) and its temporal extension(HMM-DP-MMVP) that captures the full covariance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
