Experimental evaluation of offline reinforcement learning for HVAC control in buildings
Jun Wang, Linyan Li, Qi Liu, Yu Yang

TL;DR
This paper evaluates the effectiveness of offline reinforcement learning algorithms for HVAC control in buildings, demonstrating their potential to reduce temperature violations and save energy using historical data.
Contribution
It provides a comprehensive analysis of offline RL algorithms for HVAC control, highlighting their strengths, limitations, and the impact of dataset characteristics on performance.
Findings
Offline RL can effectively reduce temperature violations by up to 28.5%.
Power consumption can be decreased by up to 12.1% with offline RL controllers.
Datasets with certain suboptimality levels and smaller scale are suitable for training effective RL-based HVAC controllers.
Abstract
Reinforcement learning (RL) techniques have been increasingly investigated for dynamic HVAC control in buildings. However, most studies focus on exploring solutions in online or off-policy scenarios without discussing in detail the implementation feasibility or effectiveness of dealing with purely offline datasets or trajectories. The lack of these works limits the real-world deployment of RL-based HVAC controllers, especially considering the abundance of historical data. To this end, this paper comprehensively evaluates the strengths and limitations of state-of-the-art offline RL algorithms by conducting analytical and numerical studies. The analysis is conducted from two perspectives: algorithms and dataset characteristics. As a prerequisite, the necessity of applying offline RL algorithms is first confirmed in two building environments. The ability of observation history modeling to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBuilding Energy and Comfort Optimization
MethodsFocus
