Transfer Learning in Deep Learning Models for Building Load Forecasting: Case of Limited Data
Menna Nawar, Moustafa Shomer, Samy Faddel, and Huangjie Gong

TL;DR
This paper introduces a transfer learning framework using Transformer models to improve building load forecasting accuracy in scenarios with limited data, outperforming traditional deep learning models.
Contribution
It proposes a novel Building-to-Building Transfer Learning approach with Transformer models for scarce data scenarios, enhancing load forecasting performance.
Findings
Transfer learning improved accuracy by 56.8%.
Transformer outperformed LSTM and RNN models.
Root mean square error reduced to 0.009 with Transformer.
Abstract
Precise load forecasting in buildings could increase the bill savings potential and facilitate optimized strategies for power generation planning. With the rapid evolution of computer science, data-driven techniques, in particular the Deep Learning models, have become a promising solution for the load forecasting problem. These models have showed accurate forecasting results; however, they need abundance amount of historical data to maintain the performance. Considering the new buildings and buildings with low resolution measuring equipment, it is difficult to get enough historical data from them, leading to poor forecasting performance. In order to adapt Deep Learning models for buildings with limited and scarce data, this paper proposes a Building-to-Building Transfer Learning framework to overcome the problem and enhance the performance of Deep Learning models. The transfer learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Load and Power Forecasting · Image and Signal Denoising Methods · Building Energy and Comfort Optimization
MethodsMulti-Head Attention · Attention Is All You Need · RMSProp · Stochastic Gradient Descent · building to building transfer learning · Dense Connections · Adam · Position-Wise Feed-Forward Layer · Softmax · Linear Layer
