Universal Non-Intrusive Load Monitoring (UNILM) Using Filter Pipelines,   Probabilistic Knapsack, and Labelled Partition Maps

Alejandro Rodriguez-Silva; Stephen Makonin

arXiv:1907.06299·eess.SP·July 26, 2019

Universal Non-Intrusive Load Monitoring (UNILM) Using Filter Pipelines, Probabilistic Knapsack, and Labelled Partition Maps

Alejandro Rodriguez-Silva, Stephen Makonin

PDF

1 Repo

TL;DR

This paper introduces a universal, unsupervised NILM method that uses advanced filtering, probabilistic modeling, and labeling techniques to accurately disaggregate energy data across different regions.

Contribution

It presents a novel combination of filter pipelines, probabilistic knapsack, and partition maps for region-independent, unsupervised appliance energy disaggregation.

Findings

01

Achieves 93.7% accuracy in energy tracking

02

Works across various countries and appliance types

03

Handles complex appliance signals effectively

Abstract

Being able to track appliances energy usage without the need of sensors can help occupants reduce their energy consumption to help save the environment all while saving money. Non-intrusive load monitoring (NILM) tries to do just that. One of the hardest problems NILM faces is the ability to run unsupervised -- discovering appliances without prior knowledge -- and to run independent of the differences in appliance mixes and operational characteristics found in various countries and regions. We propose a solution that can do this with the use of an advanced filter pipeline to preprocess the data, a Gaussian appliance model with a probabilistic knapsack algorithm to disaggregate the aggregate smart meter signal, and partition maps to label which appliances were found and how much energy they use no matter the country/region. Experimental results show that relatively complex appliance…

Tables2

Table 1. TABLE I: Experimental Run-Times

Preccess/Step	Time (sec)	Time (min)
1a. Median Filter	1.6	0.0
1b. Bilateral Filter	12.7	0.2
1c. Anisotropic Filter	0.1	0.0
1d. Edge-Preserving Filter	875.4	14.6
1e. Edge Sharpening	0.8	0.0
1. Filter Pipeline	890.6	14.8
2. Appliance Tracking	1.5	0.0
3. Appliance Labelling	0.2	0.0
Total Run-Time	892.3	14.9

Table 2. TABLE II: Energy Truth/Filtered/Tracked

Appliance	G.Truth	Filtered	Est/Tracked	Truth vs Est
Clothes Dryer	2.753 kWh	2.753 kWh	2.604 kWh	94.5%
Fridge	0.063 kWh	0.065 kWh	0.055 kWh	87.3%
Furnace	0.174 kWh	0.167 kWh	0.144 kWh	82.8%
Aggregate	2.990 kWh	2.961 kWh	2.803 kWh	93.7%

Equations15

y_{t} = f (z_{t}),

y_{t} = f (z_{t}),

α_{i} = {N_{P 0}, N_{P 1}, N_{D 0}, N_{D 1}, y},

α_{i} = {N_{P 0}, N_{P 1}, N_{D 0}, N_{D 1}, y},

i = 1 \sum m j \in N_{i} \sum p_{ij} x_{ij}

i = 1 \sum m j \in N_{i} \sum p_{ij} x_{ij}

i = 1 \sum m j \in N_{i} \sum w_{ij} x_{ij} \leq c,

j \in N_{i} \sum x_{ij} = 1, i = 1, ..., m,

x_{ij} \in {0, 1}, i = 1, ..., m, j \in N_{i},

x_{t}, p = MCKP (∣Δ y ∣, U),

x_{t}, p = MCKP (∣Δ y ∣, U),

e n er g y = \int_{0}^{T} \frac{y _{t} \times \frac{1}{3600}}{1000} d t .

e n er g y = \int_{0}^{T} \frac{y _{t} \times \frac{1}{3600}}{1000} d t .

a cc u r a cy = \frac{e n er g y ( y )}{e n er g y ( z )} \times 100

a cc u r a cy = \frac{e n er g y ( y )}{e n er g y ( z )} \times 100

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

compsust/KP-NILM
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Universal Non-Intrusive Load Monitoring (UNILM) Using Filter Pipelines, Probabilistic Knapsack, and Labelled Partition Maps

††thanks: Research supported by NSERC Discovery Grant RGPIN-2018-06192.

Alejandro Rodriguez-Silva

School of Engineering Science

*Simon Fraser University

*Burnaby, Canada

Email: [email protected]

Stephen Makonin

School of Engineering Science

*Simon Fraser University

*Burnaby, Canada

Email: [email protected]

Abstract

Being able to track appliances energy usage without the need of sensors can help occupants reduce their energy consumption to help save the environment all while saving money. Non-intrusive load monitoring (NILM) tries to do just that. One of the hardest problems NILM faces is the ability to run unsupervised – discovering appliances without prior knowledge – and to run independent of the differences in appliance mixes and operational characteristics found in various countries and regions. We propose a solution that can do this with the use of an advanced filter pipeline to preprocess the data, a Gaussian appliance model with a probabilistic knapsack algorithm to disaggregate the aggregate smart meter signal, and partition maps to label which appliances were found and how much energy they use no matter the country/region. Experimental results show that relatively complex appliance signals can be tracked accounting for 93.7% of the total aggregate energy consumed.

Index Terms:

unsupervised learning, disaggregation, non-intrusive load monitoring, NILM, knapsack, labelled partition maps, Gaussian models, smart meter, smart grid

I Introduction

Disaggregation is a difficult, ill-posed problem that uses statistical models and algorithms to determine the unknown components that were used to sum the known aggregate value.

Disaggregating power/energy data is known as non-intrusive load monitoring (NILM) — using a building’s smart meter to track or sense appliance usage (see Figure 1) [1]. NILM uses computation, without the intrusion or cost of sensors, to model and understand the power consumption of loads/appliances in houses or buildings with the goal of reducing energy consumption. Appliance-level data inferred by NILM can be used by occupants to help make informed choices (that fit their lifestyle) as to how they want to conserve. Ultimately this is a win for their pocketbook and the environment.

I-A Mitigation by Conservation

In 2012, Natural Resources Canada (NRCan) reported that Canadian households accounted for 14.6% of total energy-related greenhouse gas emissions. Despite various government incentives and more energy-efficient appliances on the market, Canadian households have increased their energy consumption by 4.5% in 2013 (97.5 GJ/household) from 2011 (93.3 GJ/household).

A recent study [2] showed that 80% of participants want to have access to disaggregation data (i.e. knowing how their appliances consume energy) and believed that everyone should have access to this information. This study also shows when load disaggregation information is made available to occupants, those occupants can reduce their energy consumption by an average of 14% by changing appliance use habits. In fact, more research is showing that in order for occupants to reduce (on average) their energy consumption by more than 9%, real-time appliance-specific consumption information is needed [3, 4].

If we look at the COP21 Paris Climate Agreement, Canada’s commitment is to cut carbon emissions by 30% below 2005 levels by 2030 which is similar to that of the USA. If we solve the difficult problem of NILM in five years, we will have the means to reduce the amount of carbon emissions possible for homes and buildings with ample time to deploy it.

As an example, we will use numbers from the USA Environmental Protection Agency (EPA) as they are readily available and contain data as recent as 2015. In 2015, EPA reported a 12% reduction in carbon emissions, to date. This leaves 16% still needing to be reduced (or 960Mt of carbon). The EPA reports households (including commercial) emit 847Mt of carbon/year, which represents 12% of the total annual emissions. This means that from this economic sector a further 115Mt of carbon would need to be reduced to meet USA targets. We know that households can reduce their energy consumption by an average of 14% with real-time appliance-specific feedback. This 14% reduction is equivalent to 118.6Mt of carbon/year which is beyond the 115Mt reduction needed. This means the reduction goal set for the household and commercial economic sector has been met.

I-B Cost Benefit Case Study

If we can provide appliance-specific consumption information using NILM, then we can make use of the digital “smart” meters installed on almost every household in Canada. Figure 2 shows a two-year study that compares the cost of equipping a house with sensors vs. a NILM system.

To realize a small saving in energy conservation would require decades of savings (in this case 3.5) to payback to cost of purchase and installation of IoT sensors. Note, this cost is conservative as it only takes into account just the cost of 24-meter sensors and data logger — not the cost of having a professional electrician install the sensors and logger, nor does it take in account the energy costs of run the sensors and data logger continuously.

Many households would find the cost of these sensor systems unaffordable resulting in an adoption barrier. With a NILM system, energy savings can be realized immediately without an investment in sensors.

I-C Research Contributions

As a result of this work we have made the following contortions and provided a state-of-the-art NILM algorithm.

We show why NILM to a useful problem to solve to reduce energy demand and reach sustainability goals. 2. 2.

We demonstrate how a filter pipeline can create a clean, sharp signal. 3. 3.

We have designed a NILM algorithm that does not require prior training (nor uses general appliance models). 4. 4.

We show how using a labelled partition map can allow disaggregation independent of country/region with very different appliance mixes and operational characteristics.

II Background

The first peer-reviewed NILM publication was in 1992 [1] and with the recent release of publicly available datasets [6, 5, 7, 8] there is a resurgence in interest in solving NILM.

It is clear that at best NILM is a semi-supervised problem as labelled data is needed to properly identify what appliances have been disaggregated. Setting aside the labelling problem, the disaggregation part of NILM can be potentially solved via an unsupervised learning algorithm. Thus, Unsupervised NILM refers to this part and sets aside the labelling problem as something to be solved separately, perhaps through some additional algorithm or occupancy feedback system.

Supervised learning solutions are proven to be accurate [9], but brittle, making them undeployable in a real world situation. Although, deep learning methods [10, 11, 12] may show promise, if enough data is available. As a result, all serious algorithmic-based NILM research is now focused on an unsupervised NILM solution.

In terms of using advanced filters for preprocessing data there has been some work done. However, only one advanced filter is ever used [13, 14].

II-A General Model Tuning

Some NILM algorithms [15, 16] require priors to build a general model then tune it to a specific house. Some success with houses in the same country/region. However, this has not been proven to be successful inter-country/region; e.i., having a model of a dishwasher in the UK will not work for disaggregating dishwasher in USA. This method is often referred to as transfer learning [16].

II-B Online Learning

The goal of online learning is to discover and create appliance models without the use of priors. This is an ideal solution as it requires no training and can adapt to different appliance mixes and different operational characteristics that are country/region dependent. There has not been a concerted effort to solve NILM in this fashion as it would be considerably harder to do when compared to a transfer learning solution. Our proposed UNILM method attempts to solve NILM using this approach by combining the probability of Gaussian distributions and the optimization of the Knapsack Problem. Using Knapsack for NILM has been looked at before, albeit as part of a larger genetic algorithm solution [17].

III Methodology

Our proposed solution is considered universal. Universal NILM (UNILM) is both unsupervised and transferable. Unsupervised as it relies on no prior trained models as it learns online. Transferable as it can learn appliances in different houses without having prior information about each house.

III-A Prepossessing via a Filter Pipeline

We propose using a filter pipeline (see Figure 3) for prepossessing the signal as each filter cannot accomplish cleaning a signal with sharp edges and flat steady-states on its own. We use a standard median filter built into Matlab or Python/Numpy (or see [18]) for this first step to remove large spikes (e.g., from fridge compressor start-up). This is followed by sending the signal to a bilateral filter [19], then an anisotropic diffusion filter [20]. Next, we use a 2D image processing edge preserving filter that was adapted to process 1D signals as demonstrated in [21]. In the final step we edge sharpen similar to [14]. Figure 4 shows the results of filtering a 1Hz aggregate power signal.

We define a filtered signal as:

[TABLE]

where $z_{t}$ is the raw signal at time $t$ , $y_{t}$ is the resulting filtered signal after calling the filter pipeline function. Events are determined by $\Delta y$ which is the derivative of $y_{t}-y_{t-1}$ , where an OFF event is $\Delta y<s$ and an ON event is $\Delta y>s$ , else no event has occurred (i.e., seady-state). Variable $s$ is a minimum threshold step to determine if an event has occurred. For our proposed UNILM algorithm we set to $s=\pm 60$ W.

III-B Appliance Database & Probabilistic Knapsack

We have an appliance database $\mathbf{A}=\{\alpha_{1},\alpha_{2},...,\alpha_{M}\}$ of $M$ appliances found. At time $t_{0}$ $\mathbf{A}=\varnothing$ (i.e., is empty). Over time as appliances are found, $\mathbf{A}$ will grow in length; therefore, as each new appliance is discovered we have $M=M+1$ .

Each appliance $\alpha_{i}$ we define a model:

[TABLE]

where $\mathcal{N}_{P1}$ is a Gaussian PDF of the ON power demand in Watts, $\mathcal{N}_{D0}$ is a Gaussian PDF of the OFF duration in minutes, $\mathcal{N}_{D1}$ is a Gaussian PDF of the ON duration in minutes, Gaussian PDFs are defined as $\mathcal{N}(\mu,\,\sigma^{2})$ , and $\mathbf{y}$ is the power demand trace of the $\alpha_{i}$ from $1...T$ . Note that $\mathcal{N}_{P0}$ is a Gaussian PDF of the OFF power demand as some appliances remain in standby mode and still consume power. Our current proposed UNILM solution does not use $\mathcal{N}_{P0}$ ; however, future algorithms will. Note that we track the minimum power ON (i.e., the power from constantly-ON appliances) as $\check{y}$ .

To help choose what set of appliances are ON, we created a probabilistic version of the multiple-choice knapsack problem (MCKP) [22]. MCKP is defined as:

[TABLE]

where we have $N_{1},...,N_{m}$ items, each item $j\in N_{i}$ has a profit of $p_{ij}$ and weight of $w_{ij}$ , a knapsack of capacity $c$ , and binary variable $x_{ij}=1$ if and only if item $j$ is chosen in class $N_{i}$ . Notes that MCKP notation is specific to [22] and is not used in other equations defined in our proposed UNILM algorithm.

Using the appliance models defined above we select the multiple-choice values for each knapsack item (i.e., appliance):

[TABLE]

where $\mathbf{x}_{t}$ is a binary vector of length $M$ (when $\mathbf{x}_{t}[i]$ is $1$ then appliance $\alpha_{i}$ is ON, else [math] is OFF), $p$ denotes the profit between 0–100, and $\mathbf{U}$ is a vector of length $M$ where each $\mathbf{u}_{i}$ is a vector of possible power values to choose from the $i$ -th appliance which represent three standard deviations of integer values in $\mathcal{N}_{P1}$ .

III-C Unsupervised Online Appliance Learning & Tracking

We combine the probabilistic features in Gaussian distribution $f_{N}()$ in the appliances models and the optimization of MCKP in Algorithm 1 to find and track appliances online without prior knowledge using.

III-D Transfer Learning via Labelled Partition Maps

Our use of partition maps can be defined as $\mathbf{P}$ , a 3D matrix ( $D{\times}P{\times}R$ ), where $D$ is the maximum duration (in minutes) an appliance can operate for, $P$ is the maximum power demand value in Watts, $R$ is the number of different regions around the world, and $\mathbf{P}[d,p,r]$ would give a specific label; e.g., “clothes dryer”. See Figure 5 for a visual representation of one region that we used in testing.

There exists a set of labels $\mathbf{L}$ . Each appliance in $\mathbf{A}$ is assigned a label resulting in a binary vector $\mathbf{l_{j}}$ of length $M$ , where $1$ indicates appliance $\alpha_{i}$ is assigned this label, else [math]; with the restriction that appliance $\alpha_{i}$ can only be assigned to one and only one label. We can merge appliances from our database together if they may have been assigned the same label for the partition map.

IV Experiments

IV-A Experimental Setup

We take data from House-1, Block-1, (comprised of nine day’s worth of data) of the RAE dataset [8] which is sampled at 1Hz. Our disaggregator was coded in Python 3.6. We chose Python because of its array manipulation capabilities, digital signal processing libraries; moreover, its convenience when coding rapid prototypes that can get quickly translated to faster coding languages (e.g., C). All tests ran on a Mac Pro (2017 model) with a 2.3GHz Intel Core i5 processor and 8GB of memory. Furthermore, we run all our tests in one day’s worth of data without any prior knowledge.

IV-B Experimental Results

To preprocess/filter and disaggregate 5400 samples took a total run-time of 14.9 minutes (see Table I).

Figure 6 shows the individual appliances, ground truth (top) and disaggregated (middle). We can observe that the disaggregator detected four appliances (one more appliance than in the ground truth). Nevertheless, during the partition map stage, we merged these two unlabelled appliances (coloured blue and red) into one labelled appliance – the clothes dryer – because in the partition map they have resolve to the same label.

Further, the output from the disaggregator (see Figure 6), ignored the clothes dryer from sample 2620 to sample 3220. It was not able to track the approximately 200W operational state the clothes dryer was running in. Therefore, the disaggregator ignored this OFF event. Although the system was not able to track this part of the signal, the power consumption from the clothes dryer at that interval was very small and did not considerably affect the overall accuracy score.

To determine energy tracked, we integrate our 1Hz power (in Watts) samples to energy measured in kWh, as such:

[TABLE]

Figure 6 (bottom) depicts both how close the total energy tracked was as compared to the raw ground truth aggregated signal. The system was able to track 93.7% of the total aggregate energy without the use of prior information (see Table II). This accuracy measure is similar normalized disaggregation error (NDE) [23, 24]. We define the accuracy measure as:

[TABLE]

Using the same nomenclature defined in previous sections. Some of the contributing factored to a lower accuracy for the fridge (87.3%) was that fact that the ON-spikes from the compressor were filtered out. In reality the energy contributions from these ON-spikes is negligible, but for the small sample period in this case. Lower accuracy for the furnace (82.8%) may have been caused by the fact that it has a constantly-ON power reading of 44W.

V Conclusions

We have demonstrated a prototype universal NILM solution that can track relatively complex appliance signals without priors. Our experiment was able to assign 93.7% of the total aggregate energy consumed to the appliances it tracked. Although these experiments may be preliminary, they show promise for a NILM solution that works independent of country/region and without prior knowledge for modelling appliances. Disaggregation solutions (such as UNILM) could break the economic divide allowing everyone, no matter their socioeconomic situation, to participate in energy conservation.

Bibliography24

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] G. Hart, “Nonintrusive appliance load monitoring,” Proceedings of the IEEE , vol. 80, no. 12, pp. 1870–1891, Dec 1992.
2[2] P. Chakravarty and A. Gupta, “Impact of energy disaggregation on consumer behavior,” in UC Berkeley: Behavior, Energy and Climate Change Conference , 2013.
3[3] K. Ehrhardt-Martinez, K. Donnelly, S. Laitner et al. , “Advanced metering initiatives and residential feedback programs: a meta-review for household electricity-saving opportunities,” in Report Number E 105 . ACEEE Washington, DC, 2010.
4[4] K. Armel, A. Gupta, G. Shrimali, and A. Albert, “Is disaggregation the holy grail of energy efficiency? the case of electricity,” Energy Policy , vol. 52, pp. 213–234, 2013.
5[5] S. Makonin, B. Ellert, I. Bajić, and F. Popowich, “Electricity, water, and natural gas consumption of a residential house in Canada from 2012 to 2014,” Scientific Data , vol. 3, no. 160037, pp. 1–12, 06 2016.
6[6] J. Kolter and M. Johnson, “REDD: A public data set for energy disaggregation research,” in Workshop on Data Mining Applications in Sustainability (SIGKDD), San Diego, CA , vol. 25, 2011, pp. 59–62.
7[7] D. Murray, L. Stankovic, and V. Stankovic, “An electrical load measurements dataset of United Kingdom households from a two-year longitudinal study,” Scientific Data , vol. 4, pp. 160 122 EP –, 01 2017.
8[8] S. Makonin, Z. Wang, and C. Tumpach, “RAE: The Rainforest Automation energy dataset for smart grid meter data analysis,” data , vol. 3, no. 1, p. 8, 2018.