A Safe and Data-efficient Model-based Reinforcement Learning System for HVAC Control
Xianzhong Ding, Zhiyu An, Arya Rathee, and Wan Du

TL;DR
This paper introduces CLUE, a data-efficient, uncertainty-aware model-based reinforcement learning system for HVAC control that significantly reduces training data needs while maintaining comfort and energy efficiency.
Contribution
The paper presents a novel MBRL system using Gaussian Processes with meta-kernel learning for efficient building dynamics modeling in HVAC control.
Findings
Reduces training data from hundreds of days to seven.
Decreases comfort violations by 12.07% on average.
Maintains robust control performance with less data.
Abstract
Model-Based Reinforcement Learning (MBRL) has been widely studied for Heating, Ventilation, and Air Conditioning (HVAC) control in buildings. One of the critical challenges is the large amount of data required to effectively train neural networks for modeling building dynamics. This paper presents CLUE, an MBRL system for HVAC control in buildings. CLUE optimizes HVAC operations by integrating a Gaussian Process (GP) model to model building dynamics with uncertainty awareness. CLUE utilizes GP to predict state transitions as Gaussian distributions, effectively capturing prediction uncertainty and enhancing decision-making under sparse data conditions. Our approach employs a meta-kernel learning technique to efficiently set GP kernel hyperparameters using domain knowledge from diverse buildings. This drastically reduces the data requirements typically associated with GP models in HVAC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Advanced Control Systems Optimization
MethodsSparse Evolutionary Training · Gaussian Process
