Impact of Load Demand Dataset Characteristics on Clustering Validation   Indices

Mayank Jain; Mukta Jain; Tarek AlSkaif; and Soumyabrata Dev

arXiv:2108.01433·cs.CY·August 4, 2021

Impact of Load Demand Dataset Characteristics on Clustering Validation Indices

Mayank Jain, Mukta Jain, Tarek AlSkaif, and Soumyabrata Dev

PDF

1 Repo

TL;DR

This paper investigates how characteristics of residential load demand data influence the effectiveness and recommendations of various cluster validation indices, aiding in selecting suitable indices for clustering analysis.

Contribution

It reveals how dataset features affect CVI recommendations and identifies data characteristics that favor or hinder specific validation indices.

Findings

01

Data characteristics significantly influence CVI recommendations.

02

Certain features of load demand data determine the suitability of specific CVIs.

03

Guidelines are provided for selecting appropriate CVIs based on data features.

Abstract

With the inclusion of smart meters, electricity load consumption data can be fetched for individual consumer buildings at high temporal resolutions. Availability of such data has made it possible to study daily load demand profiles of the households. Clustering households based on their demand profiles is one of the primary, yet a key component of such analysis. While many clustering algorithms/frameworks can be deployed to perform clustering, they usually generate very different clusters. In order to identify the best clustering results, various cluster validation indices (CVIs) have been proposed in the literature. However, it has been noticed that different CVIs often recommend different algorithms. This leads to the problem of identifying the most suitable CVI for a given dataset. Responding to the problem, this paper shows how the recommendations of validation indices are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jain15mayank/data-characteristics-impact-on-CVIs
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.