Building HVAC Scheduling Using Reinforcement Learning via Neural Network   Based Model Approximation

Chi Zhang; Sanmukh R. Kuppannagari; Rajgopal Kannan; Viktor K.; Prasanna

arXiv:1910.05313·eess.SY·March 24, 2021

Building HVAC Scheduling Using Reinforcement Learning via Neural Network Based Model Approximation

Chi Zhang, Sanmukh R. Kuppannagari, Rajgopal Kannan, Viktor K., Prasanna

PDF

TL;DR

This paper introduces a model-based reinforcement learning method using neural network approximations and MPC for HVAC scheduling, achieving significant energy savings and improved training efficiency in building management.

Contribution

It proposes a neural network-based model predictive control approach for HVAC scheduling that enhances safety and reduces training time compared to model-free methods.

Findings

01

Achieves 17.1% to 21.8% energy savings.

02

Reduces training steps by 10x compared to model-free RL.

03

Trajectory deviation from ground truth is below 20%.

Abstract

Buildings sector is one of the major consumers of energy in the United States. The buildings HVAC (Heating, Ventilation, and Air Conditioning) systems, whose functionality is to maintain thermal comfort and indoor air quality (IAQ), account for almost half of the energy consumed by the buildings. Thus, intelligent scheduling of the building HVAC system has the potential for tremendous energy and cost savings while ensuring that the control objectives (thermal comfort, air quality) are satisfied. Recently, several works have focused on model-free deep reinforcement learning based techniques such as Deep Q-Network (DQN). Such methods require extensive interactions with the environment. Thus, they are impractical to implement in real systems due to low sample efficiency. Safety-aware exploration is another challenge in real systems since certain actions at particular states may result in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.