Clustering Rooftop PV Systems via Probabilistic Embeddings
Kutay B\"olat, Tarek Alskaif, Peter Palensky, Simon Tindemans

TL;DR
This paper introduces a probabilistic embedding-based clustering method for rooftop PV systems, effectively capturing their power generation patterns and uncertainties to improve grouping, robustness, and missing data imputation in large-scale datasets.
Contribution
It presents a novel probabilistic embedding framework that encodes PV system characteristics and uncertainties, enhancing clustering accuracy and robustness over traditional physics-based methods.
Findings
Outperforms physics-based baseline in representativeness and robustness
Supports reliable missing-value imputation
Provides practical hyperparameter tuning guidance
Abstract
As the number of rooftop photovoltaic (PV) installations increases, aggregators and system operators are required to monitor and analyze these systems, raising the challenge of integration and management of large, spatially distributed time-series data that are both high-dimensional and affected by missing values. In this work, a probabilistic entity embedding-based clustering framework is proposed to address these problems. This method encodes each PV system's characteristic power generation patterns and uncertainty as a probability distribution, then groups systems by their statistical distances and agglomerative clustering. Applied to a multi-year residential PV dataset, it produces concise, uncertainty-aware cluster profiles that outperform a physics-based baseline in representativeness and robustness, and support reliable missing-value imputation. A systematic hyperparameter study…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSolar Radiation and Photovoltaics · Smart Grid Energy Management · Photovoltaic System Optimization Techniques
