Investigating Deep Learning Model Calibration for Classification Problems in Mechanics
Saeed Mohammadzadeh, Peerasait Prachaseree, Emma Lejeune

TL;DR
This paper investigates the calibration of deep learning models in engineering mechanics, evaluating methods like ensemble averaging and temperature scaling across diverse datasets to improve probability accuracy.
Contribution
It provides a comprehensive analysis of model calibration techniques in mechanics-related deep learning applications, highlighting ensemble averaging as particularly effective.
Findings
Ensemble averaging improves model calibration consistently.
Temperature scaling offers limited calibration benefits.
The study covers seven diverse mechanics datasets.
Abstract
Recently, there has been a growing interest in applying machine learning methods to problems in engineering mechanics. In particular, there has been significant interest in applying deep learning techniques to predicting the mechanical behavior of heterogeneous materials and structures. Researchers have shown that deep learning methods are able to effectively predict mechanical behavior with low error for systems ranging from engineered composites, to geometrically complex metamaterials, to heterogeneous biological tissue. However, there has been comparatively little attention paid to deep learning model calibration, i.e., the match between predicted probabilities of outcomes and the true probabilities of outcomes. In this work, we perform a comprehensive investigation into ML model calibration across seven open access engineering mechanics datasets that cover three distinct types of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Software Engineering Research · Machine Learning and Data Classification
MethodsHigh-Order Consensuses
