Kernel Density Estimation with Linked Boundary Conditions
Matthew J. Colbrook, Zdravko I. Botev, Karsten Kuritz, Shev, MacNamara

TL;DR
This paper introduces a novel kernel density estimator that effectively handles linked boundary conditions on a finite interval, reducing boundary bias and improving approximation accuracy, with applications in cancer research and biology.
Contribution
The paper develops a new KDE incorporating linked boundary conditions using a non-self-adjoint diffusion process and the Fokas method, advancing boundary bias correction techniques.
Findings
The new KDE achieves consistency and negligible boundary bias.
It demonstrates increased approximation rates compared to existing methods.
Numerical experiments show the method is fast and accurate.
Abstract
Kernel density estimation on a finite interval poses an outstanding challenge because of the well-recognized bias at the boundaries of the interval. Motivated by an application in cancer research, we consider a boundary constraint linking the values of the unknown target density function at the boundaries. We provide a kernel density estimator (KDE) that successfully incorporates this linked boundary condition, leading to a non-self-adjoint diffusion process and expansions in non-separable generalized eigenfunctions. The solution is rigorously analyzed through an integral representation given by the unified transform (or Fokas method). The new KDE possesses many desirable properties, such as consistency, asymptotically negligible bias at the boundaries, and an increased rate of approximation, as measured by the AMISE. We apply our method to the motivating example in biology and provide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
