Optimal Control of District Cooling Energy Plant with Reinforcement Learning and MPC
Zhong Guo, Aditya Chaudhari, Austin R. Coffman, Prabir Barooah

TL;DR
This paper compares reinforcement learning and model predictive control for optimizing district cooling energy plants, demonstrating both methods achieve significant cost savings with different computational approaches.
Contribution
It introduces a novel RL controller based on least-squares policy iteration and compares it with an MPC controller that avoids mixed-integer programming.
Findings
Both controllers achieve about 17% cost savings.
The RL controller uses a new Q-learning algorithm with effective design choices.
The MPC controller only requires a nonlinear program solver.
Abstract
We consider the problem of optimal control of district cooling energy plants (DCEPs) consisting of multiple chillers, a cooling tower, and a thermal energy storage (TES), in the presence of time-varying electricity price. A straightforward application of model predictive control (MPC) requires solving a challenging mixed-integer nonlinear program (MINLP) because of the on/off of chillers and the complexity of the DCEP model. Reinforcement learning (RL) is an attractive alternative since its real-time control computation is much simpler. But designing an RL controller is challenging due to myriad design choices and computationally intensive training. In this paper, we propose an RL controller and an MPC controller for minimizing the electricity cost of a DCEP, and compare them via simulations. The two controllers are designed to be comparable in terms of objective and information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Energy Management · Advanced Control Systems Optimization
MethodsQ-Learning
