Hierarchical Control Framework Integrating LLMs with RL for Decarbonized HVAC Operation

Dianyu Zhong; Tian Xing; Kailai Sun; Xu Yang; Heye Huang; Irfan Qaisar; Tinggang Jia; Shaobo Wang; Qianchuan Zhao

arXiv:2603.26050·eess.SY·March 30, 2026

Hierarchical Control Framework Integrating LLMs with RL for Decarbonized HVAC Operation

Dianyu Zhong, Tian Xing, Kailai Sun, Xu Yang, Heye Huang, Irfan Qaisar, Tinggang Jia, Shaobo Wang, Qianchuan Zhao

PDF

TL;DR

This paper introduces a hierarchical control framework combining LLMs and RL to improve energy efficiency and comfort in decarbonized multi-zone HVAC systems, addressing exploration and action space challenges.

Contribution

It presents a novel hierarchical approach where LLMs generate feasible action masks to enhance RL efficiency in HVAC control, validated with real-world building data.

Findings

01

Achieved 39.1% reduction in comfort deviation compared to DQN.

02

Reduced HVAC energy use to 140.90 kWh, outperforming vanilla RL baselines.

03

Demonstrated LLM-guided masking improves exploration and stability.

Abstract

Heating, ventilation, and air conditioning (HVAC) systems account for a substantial share of building energy consumption. Environmental uncertainty and dynamic occupancy behavior bring challenges in decarbonized HVAC control. Reinforcement learning (RL) can optimize long-horizon comfort-energy trade-offs but suffers from exponential action-space growth and inefficient exploration in multi-zone buildings. Large language models (LLMs) can encode semantic context and operational knowledge, yet when used alone they lack reliable closed-loop numerical optimization and may result in less reliable comfort-energy trade-offs. To address these limitations, we propose a hierarchical control framework in which a fine-tuned LLM, trained on historical building operation data, generates state-dependent feasible action masks that prune the combinatorial joint action space into operationally plausible…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.