Clustering-based Aggregations for Prediction in Event Streams
Yorick Spenrath, Marwan Hassani, Boudewijn F. Van Dongen

TL;DR
This paper introduces CAPiES, an online clustering-based framework that balances prediction accuracy and individual usefulness in large-scale event streams like shopper behavior and invoice payments.
Contribution
The paper presents a novel online clustering framework, CAPiES, that improves prediction accuracy by aggregating entities, addressing the trade-off between accuracy and individual detail.
Findings
Effective in large-scale real-world scenarios
Balances accuracy with individual usefulness
Demonstrates trade-off in experimental evaluation
Abstract
Predicting the behaviour of shoppers provides valuable information for retailers, such as the expected spend of a shopper or the total turnover of a supermarket. The ability to make predictions on an individual level is useful, as it allows supermarkets to accurately perform targeted marketing. However, given the expected number of shoppers and their diverse behaviours, making accurate predictions on an individual level is difficult. This problem does not only arise in shopper behaviour, but also in various business processes, such as predicting when an invoice will be paid. In this paper we present CAPiES, a framework that focuses on this trade-off in an online setting. By making predictions on a larger number of entities at a time, we improve the predictive accuracy but at the potential cost of usefulness since we can say less about the individual entities. CAPiES is developed in an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Customer churn and segmentation · Complex Network Analysis Techniques
