Density estimation with atoms, and functional estimation for mixed discrete-continuous data
Aytijhya Saha, Aaditya Ramdas

TL;DR
This paper introduces a simple modification to existing density estimation methods that enables consistent estimation in mixed discrete-continuous data scenarios without additional assumptions, improving robustness and practical performance.
Contribution
A straightforward wrapper approach that adapts existing density estimators to handle mixed data, removing the need for knowledge of discrete support or mixing proportions.
Findings
Method achieves consistency in mixed data scenarios.
Improved empirical performance over traditional methods.
No additional assumptions required for the modified estimators.
Abstract
In classical density (or density-functional) estimation, it is standard to assume that the underlying distribution has a density with respect to the Lebesgue measure. However, when the data distribution is a mixture of continuous and discrete components, the resulting methods are inconsistent in theory and perform poorly in practice. In this paper, we point out that a minor modification of existing methods for nonparametric density (functional) estimation can allow us to fully remove this assumption while retaining nearly identical theoretical guarantees and improved empirical performance. Our approach is very simple: data points that appear exactly once are likely to originate from the continuous component, whereas repeated observations are indicative of the discrete part. Leveraging this observation, we modify existing estimators for a broad class of functionals of the continuous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Inference · Markov Chains and Monte Carlo Methods
