Input Convex Encoder-Only Transformer for Fast and Gradient-Stable MPC in Building Demand Response
Kaipeng Xu, Zhuo Zhi, Keyue Jiang

TL;DR
This paper introduces the Input-Convex Encoder-only Transformer (IC-EoT), a novel neural network architecture that enables fast, gradient-stable, and convex optimization-based Model Predictive Control for building demand response, significantly reducing solver times.
Contribution
The paper proposes the IC-EoT architecture, combining Transformer parallel processing with input convexity, to improve real-time MPC performance over existing recurrent convex neural networks.
Findings
IC-EoT is immune to gradient instability affecting recurrent ICNNs.
IC-EoT reduces MPC solver times by up to 8.3 times compared to IC-LSTM.
IC-EoT maintains predictive accuracy comparable to existing models.
Abstract
Learning-based Model Predictive Control (MPC) has emerged as a powerful strategy for building demand response. However, its practical deployment is often hindered by the non-convex optimization problems induced by standard neural network models. These problems lead to long solver times and suboptimal solutions, making real-time control over long horizons challenging. While Input Convex Neural Networks (ICNNs), such as Input-Convex Long Short-Term Memorys (IC-LSTMs), are developed to address the convexity issue, their recurrent architectures suffer from high computational cost and gradient instability as the prediction horizon increases. To overcome these limitations, this paper introduces the Input-Convex Encoder-only Transformer (IC-EoT), a novel architecture that synergizes the parallel processing capabilities of the Transformer with the guaranteed tractability of input convexity. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBuilding Energy and Comfort Optimization · Smart Grid Energy Management · Advanced Control Systems Optimization
