Hierarchical Control Framework Integrating LLMs with RL for Decarbonized HVAC Operation
Dianyu Zhong, Tian Xing, Kailai Sun, Xu Yang, Heye Huang, Irfan Qaisar, Tinggang Jia, Shaobo Wang, Qianchuan Zhao

TL;DR
This paper introduces a hierarchical control framework combining LLMs and RL to improve energy efficiency and comfort in decarbonized multi-zone HVAC systems, addressing exploration and action space challenges.
Contribution
It presents a novel hierarchical approach where LLMs generate feasible action masks to enhance RL efficiency in HVAC control, validated with real-world building data.
Findings
Achieved 39.1% reduction in comfort deviation compared to DQN.
Reduced HVAC energy use to 140.90 kWh, outperforming vanilla RL baselines.
Demonstrated LLM-guided masking improves exploration and stability.
Abstract
Heating, ventilation, and air conditioning (HVAC) systems account for a substantial share of building energy consumption. Environmental uncertainty and dynamic occupancy behavior bring challenges in decarbonized HVAC control. Reinforcement learning (RL) can optimize long-horizon comfort-energy trade-offs but suffers from exponential action-space growth and inefficient exploration in multi-zone buildings. Large language models (LLMs) can encode semantic context and operational knowledge, yet when used alone they lack reliable closed-loop numerical optimization and may result in less reliable comfort-energy trade-offs. To address these limitations, we propose a hierarchical control framework in which a fine-tuned LLM, trained on historical building operation data, generates state-dependent feasible action masks that prune the combinatorial joint action space into operationally plausible…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
