Go Beyond Black-box Policies: Rethinking the Design of Learning Agent for Interpretable and Verifiable HVAC Control
Zhiyu An, Xianzhong Ding, Wan Du

TL;DR
This paper introduces a decision tree-based, verifiable, and interpretable reinforcement learning approach for HVAC control that improves energy efficiency and occupant comfort while providing reliability guarantees.
Contribution
It presents a novel method for extracting verifiable decision tree policies from thermal models, addressing reliability and interpretability in HVAC RL control.
Findings
Achieves 68.4% energy savings over state-of-the-art methods.
Increases human comfort gain by 14.8%.
Reduces computation overhead by 1127 times.
Abstract
Recent research has shown the potential of Model-based Reinforcement Learning (MBRL) to enhance energy efficiency of Heating, Ventilation, and Air Conditioning (HVAC) systems. However, existing methods rely on black-box thermal dynamics models and stochastic optimizers, lacking reliability guarantees and posing risks to occupant health. In this work, we overcome the reliability bottleneck by redesigning HVAC controllers using decision trees extracted from existing thermal dynamics models and historical data. Our decision tree-based policies are deterministic, verifiable, interpretable, and more energy-efficient than current MBRL methods. First, we introduce a novel verification criterion for RL agents in HVAC control based on domain knowledge. Second, we develop a policy extraction procedure that produces a verifiable decision tree policy. We found that the high dimensionality of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Control Systems Optimization · Fuzzy Logic and Control Systems · Data Stream Mining Techniques
