End-to-End Differentiable Predictive Control with Guaranteed Constraint Satisfaction and feasibility for Building Demand Response
Kaipeng Xu, Zhuo Zhi, Ruixuan Zhao, Keyue Jiang

TL;DR
This paper introduces an end-to-end differentiable predictive control framework using transformer models and online constraint tightening, achieving near-perfect constraint satisfaction in building demand response with theoretical guarantees.
Contribution
It proposes a novel E2E-DPC method that models complex building dynamics and guarantees constraint satisfaction without offline terminal set computation.
Findings
Over 99% reduction in constraint violations.
Maintains near-optimal electricity costs.
Validated in high-fidelity building simulations.
Abstract
The high energy consumption of buildings presents a critical need for advanced control strategies like Demand Response (DR). Differentiable Predictive Control (DPC) has emerged as a promising method for learning explicit control policies, yet conventional DPC frameworks are hindered by three key limitations: the use of simplistic dynamics models with limited expressiveness, a decoupled training paradigm that fails to optimize for closed-loop performance, and a lack of practical safety guarantees under realistic assumptions. To address these shortcomings, this paper proposes a novel End-to-End Differentiable Predictive Control (E2E-DPC) framework. Our approach utilizes an Encoder-Only Transformer to model the complex system dynamics and employs a unified, performance-oriented loss to jointly train the model and the control policy. Crucially, we introduce an online tube-based constraint…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Energy Management · Advanced Control Systems Optimization · Building Energy and Comfort Optimization
