Uncertainty Estimation in Machine Learning
Valentin Arkov

TL;DR
This paper discusses the challenges of estimating uncertainty in machine learning models, especially complex and nonlinear ones, and explores the use of non-parametric techniques and high-performance computing to address these issues.
Contribution
It highlights the difficulties in uncertainty estimation for complex models and suggests leveraging modern supercomputing resources and non-parametric methods to improve evaluation.
Findings
Uncertainty estimation is challenging in nonlinear, complex models.
Pre-trained models like GPT-3 exemplify large-scale machine learning.
High-performance computing enables more effective uncertainty evaluation.
Abstract
Most machine learning techniques are based upon statistical learning theory, often simplified for the sake of computing speed. This paper is focused on the uncertainty aspect of mathematical modeling in machine learning. Regression analysis is chosen to further investigate the evaluation aspect of uncertainty in model coefficients and, more importantly, in the output feature value predictions. A survey demonstrates major stages in the conventional least squares approach to the creation of the regression model, along with its uncertainty estimation. On the other hand, it is shown that in machine learning the model complexity and severe nonlinearity become serious obstacles to uncertainty evaluation. Furthermore, the process of machine model training demands high computing power, not available at the level of personal computers. This is why so-called pre-trained models are widely used in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Processing Techniques · Computational Physics and Python Applications · Neural Networks and Applications
MethodsAttention Is All You Need · Linear Layer · Softmax · Layer Normalization · Multi-Head Attention · Byte Pair Encoding · Adam · Label Smoothing · Residual Connection · Dropout
