Data-driven Approaches to Surrogate Machine Learning Model Development
H. Rhys Jones, Tingting Mu, Andrei C. Popescu, and Yusuf Sulehman

TL;DR
This paper applies data augmentation, custom loss functions, and transfer learning to improve surrogate machine learning models for engineering applications, achieving at least 38% performance gains.
Contribution
It demonstrates the effectiveness of combining data augmentation, custom loss functions, and transfer learning specifically for surrogate models in the nuclear industry.
Findings
Performance improved by at least 38% with combined techniques.
Transfer learning enhances surrogate model accuracy significantly.
Each technique independently contributes to model performance.
Abstract
We demonstrate the adaption of three established methods to the field of surrogate machine learning model development. These methods are data augmentation, custom loss functions and transfer learning. Each of these methods have seen widespread use in the field of machine learning, however, here we apply them specifically to surrogate machine learning model development. The machine learning model that forms the basis behind this work was intended to surrogate a traditional engineering model used in the UK nuclear industry. Previous performance of this model has been hampered by poor performance due to limited training data. Here, we demonstrate that through a combination of additional techniques, model performance can be significantly improved. We show that each of the aforementioned techniques have utility in their own right and in combination with one another. However, we see them best…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning
