Building HVAC Scheduling Using Reinforcement Learning via Neural Network Based Model Approximation
Chi Zhang, Sanmukh R. Kuppannagari, Rajgopal Kannan, Viktor K., Prasanna

TL;DR
This paper introduces a model-based reinforcement learning method using neural network approximations and MPC for HVAC scheduling, achieving significant energy savings and improved training efficiency in building management.
Contribution
It proposes a neural network-based model predictive control approach for HVAC scheduling that enhances safety and reduces training time compared to model-free methods.
Findings
Achieves 17.1% to 21.8% energy savings.
Reduces training steps by 10x compared to model-free RL.
Trajectory deviation from ground truth is below 20%.
Abstract
Buildings sector is one of the major consumers of energy in the United States. The buildings HVAC (Heating, Ventilation, and Air Conditioning) systems, whose functionality is to maintain thermal comfort and indoor air quality (IAQ), account for almost half of the energy consumed by the buildings. Thus, intelligent scheduling of the building HVAC system has the potential for tremendous energy and cost savings while ensuring that the control objectives (thermal comfort, air quality) are satisfied. Recently, several works have focused on model-free deep reinforcement learning based techniques such as Deep Q-Network (DQN). Such methods require extensive interactions with the environment. Thus, they are impractical to implement in real systems due to low sample efficiency. Safety-aware exploration is another challenge in real systems since certain actions at particular states may result in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
