Sometimes nonparametrics beat parametrics, even when the model is right
Morten Byholt, Nils Lid Hjort

TL;DR
This paper demonstrates that nonparametric density estimators, like kernel density estimators, can outperform parametric methods such as maximum likelihood estimation in small samples, challenging conventional assumptions about model correctness.
Contribution
It provides a counter-intuitive example showing nonparametric methods can outperform parametric ones even when the parametric model is correct, especially in small sample scenarios.
Findings
Kernel density estimators can outperform parametric estimators in small samples.
Nonparametric methods are not always inferior to parametric methods when models are correct.
Small-sample analysis reveals surprising performance advantages of nonparametric estimators.
Abstract
A basic issue in both teaching of and practice of statistics is the interplay between modelling assumptions and inference performance. The general message conveyed is that stronger assumptions lead to better statistical performance of the relevant estimators, tests and confidence intervals, provided that these assumptions hold. On the other hand, fewer assumptions often lead to safer and more robust methods that are good also outside narrow conditions, but not quite as good as specialist methods that exploit such narrower conditions, if these are fulfilled. This interplay is nicely illustrated in the context of density estimation, where parametric and nonparametric methods can be contrasted. The parametric ones have mean squared errors of size in terms of sample size if the parametric model is right, but are not even consistent outside the model. The nonparametric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Control Systems and Identification
