TL;DR
This paper investigates how characteristics of residential load demand data influence the effectiveness and recommendations of various cluster validation indices, aiding in selecting suitable indices for clustering analysis.
Contribution
It reveals how dataset features affect CVI recommendations and identifies data characteristics that favor or hinder specific validation indices.
Findings
Data characteristics significantly influence CVI recommendations.
Certain features of load demand data determine the suitability of specific CVIs.
Guidelines are provided for selecting appropriate CVIs based on data features.
Abstract
With the inclusion of smart meters, electricity load consumption data can be fetched for individual consumer buildings at high temporal resolutions. Availability of such data has made it possible to study daily load demand profiles of the households. Clustering households based on their demand profiles is one of the primary, yet a key component of such analysis. While many clustering algorithms/frameworks can be deployed to perform clustering, they usually generate very different clusters. In order to identify the best clustering results, various cluster validation indices (CVIs) have been proposed in the literature. However, it has been noticed that different CVIs often recommend different algorithms. This leads to the problem of identifying the most suitable CVI for a given dataset. Responding to the problem, this paper shows how the recommendations of validation indices are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
