Data-driven control of room temperature and bidirectional EV charging using deep reinforcement learning: simulations and experiments
B. Svetozarevic, C.Baumann, S. Muntwiler, L. Di Natale, M. Zeilinger,, P. Heer

TL;DR
This paper introduces a data-driven deep reinforcement learning approach for joint control of room temperature and EV charging, achieving energy savings and comfort improvements in simulations and real building experiments.
Contribution
It presents a novel black-box pipeline using neural networks and DRL for building and EV control without complex physics models, validated through simulations and real-world deployment.
Findings
17% energy savings over heating season
19% better comfort satisfaction than standard controllers
30% energy savings in real building experiment
Abstract
This work presents a fully data-driven, black-box pipeline to obtain an optimal control policy for a multi-loop building control problem based on historical building and weather data, thus without the need for complex physics-based modelling. We demonstrate the method for joint control of room temperature and bidirectional EV charging to maximize the occupant thermal comfort and energy savings while leaving enough energy in the EV battery for the next trip. We modelled the room temperature with a recurrent neural network and EV charging with a piece-wise linear function. Using these models as a simulation environment, we applied a deep reinforcement learning (DRL) algorithm to obtain an optimal control policy. The learnt policy achieves on average 17% energy savings over the heating season and 19% better comfort satisfaction than a standard RB room temperature controller. When a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAdam · Weight Decay · Dense Connections · Experience Replay · Batch Normalization · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Deep Deterministic Policy Gradient
