Waveform Signal Entropy and Compression Study of Whole-Building Energy Datasets
Thomas Kriechbaumer, Hans-Arno Jacobsen

TL;DR
This study analyzes the entropy and compression of large-scale whole-building energy datasets, demonstrating how optimized encoding can significantly reduce storage needs while preserving data usability.
Contribution
It introduces a comprehensive benchmarking of file formats and compression methods for high-rate energy datasets, highlighting potential storage savings and data quality improvements.
Findings
Some datasets do not fully utilize measurement precision
Compression can reduce dataset size by up to 73%
Optimal file formats improve data accessibility and storage efficiency
Abstract
Electrical energy consumption has been an ongoing research area since the coming of smart homes and Internet of Things devices. Consumption characteristics and usages profiles are directly influenced by building occupants and their interaction with electrical appliances. Extracted information from these data can be used to conserve energy and increase user comfort levels. Data analysis together with machine learning models can be utilized to extract valuable information for the benefit of occupants themselves, power plants, and grid operators. Public energy datasets provide a scientific foundation to develop and benchmark these algorithms and techniques. With datasets exceeding tens of terabytes, we present a novel study of five whole-building energy datasets with high sampling rates, their signal entropy, and how a well-calibrated measurement can have a significant effect on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
