Active Reinforcement Learning for Robust Building Control
Doseok Jang, Larry Yan, Lucas Spangher, Costas Spanos

TL;DR
This paper introduces ActivePLR, a novel uncertainty-aware environment design algorithm for reinforcement learning that enhances building control by optimizing energy use and occupant comfort under normal and extreme weather conditions.
Contribution
The paper presents ActivePLR, a new UED algorithm that prioritizes performance in a specific environment while improving robustness, specifically applied to building control.
Findings
ActivePLR outperforms existing UED algorithms in energy efficiency.
ActivePLR maintains occupant comfort under extreme weather.
The approach effectively balances performance and robustness.
Abstract
Reinforcement learning (RL) is a powerful tool for optimal control that has found great success in Atari games, the game of Go, robotic control, and building optimization. RL is also very brittle; agents often overfit to their training environment and fail to generalize to new settings. Unsupervised environment design (UED) has been proposed as a solution to this problem, in which the agent trains in environments that have been specially selected to help it learn. Previous UED algorithms focus on trying to train an RL agent that generalizes across a large distribution of environments. This is not necessarily desirable when we wish to prioritize performance in one environment over others. In this work, we will be examining the setting of robust RL building control, where we wish to train an RL agent that prioritizes performing well in normal weather while still being robust to extreme…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBuilding Energy and Comfort Optimization · Smart Grid Energy Management · Smart Parking Systems Research
MethodsFocus · Balanced Selection
