Towards a Deeper Understanding of Transformer for Residential Non-intrusive Load Monitoring
Minhajur Rahman, Yasir Arafat

TL;DR
This paper investigates how various hyper-parameters affect transformer models in residential Non-Intrusive Load Monitoring, providing insights and guidelines to optimize their performance.
Contribution
It offers a comprehensive analysis of hyper-parameter impacts on transformer performance in NILM and proposes optimized settings that improve accuracy over existing models.
Findings
Optimal hyper-parameters identified for transformer models in NILM
Transformer with tuned hyper-parameters outperforms existing models
Guidelines for hyper-parameter selection in NILM transformers
Abstract
Transformer models have demonstrated impressive performance in Non-Intrusive Load Monitoring (NILM) applications in recent years. Despite their success, existing studies have not thoroughly examined the impact of various hyper-parameters on model performance, which is crucial for advancing high-performing transformer models. In this work, a comprehensive series of experiments have been conducted to analyze the influence of these hyper-parameters in the context of residential NILM. This study delves into the effects of the number of hidden dimensions in the attention layer, the number of attention layers, the number of attention heads, and the dropout ratio on transformer performance. Furthermore, the role of the masking ratio has explored in BERT-style transformer training, providing a detailed investigation into its impact on NILM tasks. Based on these experiments, the optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStructural Health Monitoring Techniques · Elevator Systems and Control · High voltage insulation and dielectric phenomena
MethodsSoftmax · Attention Is All You Need · Dropout
