Simulation Studies on Deep Reinforcement Learning for Building Control with Human Interaction
Donghwan Lee, Niao He, Seungjae Lee, Panagiota Karava, Jianghai Hu

TL;DR
This paper explores the use of deep reinforcement learning, specifically DDPG, for building climate control with occupant interaction, demonstrating its potential to handle complex, uncertain, and partially observable environments in simulation.
Contribution
It applies DDPG to building control problems with occupant interaction, addressing partial observability and uncertainties, and evaluates its performance through simulation studies.
Findings
DDPG achieves reasonable control performance in simulations.
The approach handles high-dimensional, stochastic, and partially observable states.
Simulation results show computational feasibility of the method.
Abstract
The building sector consumes the largest energy in the world, and there have been considerable research interests in energy consumption and comfort management of buildings. Inspired by recent advances in reinforcement learning (RL), this paper aims at assessing the potential of RL in building climate control problems with occupant interaction. We apply a recent RL approach, called DDPG (deep deterministic policy gradient), for the continuous building control tasks and assess its performance with simulation studies in terms of its ability to handle (a) the partial state observability due to sensor limitations; (b) complex stochastic system with high-dimensional state-spaces, which are jointly continuous and discrete; (c) uncertainties due to ambient weather conditions, occupant's behavior, and comfort feelings. Especially, the partial observability and uncertainty due to the occupant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBuilding Energy and Comfort Optimization · Smart Grid Energy Management · Energy Efficiency and Management
MethodsAdam · Weight Decay · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dense Connections · Batch Normalization · Experience Replay · Deep Deterministic Policy Gradient
