Direct Load Control of Thermostatically Controlled Loads Based on Sparse Observations Using Deep Reinforcement Learning
Frederik Ruelens, Bert J. Claessens, Peter Vrancx, Fred Spiessens, and, Geert Deconinck

TL;DR
This paper explores using deep reinforcement learning, specifically LSTM networks, to optimize demand response for thermostatically controlled loads with sparse observational data, outperforming other neural network architectures.
Contribution
It demonstrates that LSTM networks effectively extract features from sparse time-series data for near-optimal control policies in residential heating systems.
Findings
LSTM outperforms CNN in this scenario
Deep learning can handle sparse observations effectively
Recurrent neural networks are suitable for demand response tasks
Abstract
This paper considers a demand response agent that must find a near-optimal sequence of decisions based on sparse observations of its environment. Extracting a relevant set of features from these observations is a challenging task and may require substantial domain knowledge. One way to tackle this problem is to store sequences of past observations and actions in the state vector, making it high dimensional, and apply techniques from deep learning. This paper investigates the capabilities of different deep learning techniques, such as convolutional neural networks and recurrent neural networks, to extract relevant features for finding near-optimal policies for a residential heating system and electric water heater that are hindered by sparse observations. Our simulation results indicate that in this specific scenario, feeding sequences of time-series to an LSTM network, which is a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
