Two-component Mixture Model in the Presence of Covariates
Nabarun Deb, Sujayam Saha, Adityanand Guntuboyina, Bodhisattva Sen

TL;DR
This paper introduces a flexible, nonparametric approach for two-component mixture models with covariates, providing estimators with near-parametric convergence rates and demonstrating effectiveness through simulations and real data applications.
Contribution
It proposes a tuning parameter-free nonparametric maximum likelihood method for mixture models with covariates, along with scalable marginal methods and theoretical convergence guarantees.
Findings
Estimators achieve near-parametric convergence rates.
Methods perform well in neuroscience and astronomy data.
The R package NPMLEmix implements all proposed procedures.
Abstract
In this paper, we study a generalization of the two-groups model in the presence of covariates --- a problem that has recently received much attention in the statistical literature due to its applicability in multiple hypotheses testing problems. The model we consider allows for infinite dimensional parameters and offers flexibility in modeling the dependence of the response on the covariates. We discuss the identifiability issues arising in this model and systematically study several estimation strategies. We propose a tuning parameter-free nonparametric maximum likelihood method, implementable via the EM algorithm, to estimate the unknown parameters. Further, we derive the rate of convergence of the proposed estimators --- in particular, we show that the finite sample Hellinger risk for every `approximate' nonparametric maximum likelihood estimator achieves a near-parametric rate (up…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
