Kernel Density Estimation with Berkson Error

James P. Long; Noureddine El Karoui; John A. Rice

arXiv:1401.3362·stat.ME·July 30, 2014·1 cites

Kernel Density Estimation with Berkson Error

James P. Long, Noureddine El Karoui, John A. Rice

PDF

Open Access

TL;DR

This paper develops kernel density estimators for the convolution of a true density with known Berkson error, compares bandwidth selection methods, and introduces a data-driven bandwidth estimator with applications in epidemiology.

Contribution

It introduces a new approach for bandwidth selection in Berkson error density estimation and analyzes its performance both asymptotically and through simulations.

Findings

01

Optimal bandwidth depends on the error structure.

02

Smoothing is crucial when the error density is concentrated near zero.

03

The proposed data-driven estimator performs well on NO₂ exposure data.

Abstract

Given a sample ${X_{i}}_{i = 1}^{n}$ from $f_{X}$ , we construct kernel density estimators for $f_{Y}$ , the convolution of $f_{X}$ with a known error density $f_{ϵ}$ . This problem is known as density estimation with Berkson error and has applications in epidemiology and astronomy. Little is understood about bandwidth selection for Berkson density estimation. We compare three approaches to selecting the bandwidth both asymptotically, using large sample approximations to the MISE, and at finite samples, using simulations. Our results highlight the relationship between the structure of the error $f_{ϵ}$ and the optimal bandwidth. In particular, the results demonstrate the importance of smoothing when the error term $f_{ϵ}$ is concentrated near 0. We propose a data--driven bandwidth estimator and test its performance on NO $_{2}$ exposure data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Bayesian Methods and Mixture Models