Why Machine Learning Cannot Ignore Maximum Likelihood Estimation

Mark J. van der Laan; Sherri Rose

arXiv:2110.12112·math.ST·October 26, 2021

Why Machine Learning Cannot Ignore Maximum Likelihood Estimation

Mark J. van der Laan, Sherri Rose

PDF

Open Access

TL;DR

This paper emphasizes the importance of integrating maximum likelihood estimation into machine learning to ensure statistical rigor and improve the estimation of functional parameters like prediction functions and densities.

Contribution

It highlights the necessity of incorporating maximum likelihood methods into machine learning for better theoretical foundations and practical impact.

Findings

01

Maximum likelihood is essential for statistical inference in machine learning.

02

Integrating MLE improves the estimation of prediction functions.

03

The paper advocates for a foundational shift towards MLE in ML research.

Abstract

The growth of machine learning as a field has been accelerating with increasing interest and publications across fields, including statistics, but predominantly in computer science. How can we parse this vast literature for developments that exemplify the necessary rigor? How many of these manuscripts incorporate foundational theory to allow for statistical inference? Which advances have the greatest potential for impact in practice? One could posit many answers to these queries. Here, we assert that one essential idea is for machine learning to integrate maximum likelihood for estimation of functional parameters, such as prediction functions and conditional densities.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification