Model-free Estimation of Latent Structure via Multiscale Nonparametric Maximum Likelihood
Bryon Aragam, Ruiyi Yang

TL;DR
This paper introduces a model-free, multiscale nonparametric maximum likelihood approach to estimate and interpret latent structures in multivariate data, enabling effective clustering without prior assumptions.
Contribution
It develops a novel multiscale density representation and model selection method for uncovering latent structures without assuming their existence beforehand.
Findings
Effective in capturing diverse latent structures
Provides asymptotic characterization of estimators
Demonstrates improved clustering performance
Abstract
Multivariate distributions often carry latent structures that are difficult to identify and estimate, and which better reflect the data generating mechanism than extrinsic structures exhibited simply by the raw data. In this paper, we propose a model-free approach for estimating such latent structures whenever they are present, without assuming they exist a priori. Given an arbitrary density , we construct a multiscale representation of the density and propose data-driven methods for selecting representative models that capture meaningful discrete structure. Our approach uses a nonparametric maximum likelihood estimator to estimate the latent structure at different scales and we further characterize their asymptotic limits. By carrying out such a multiscale analysis, we obtain coarseto-fine structures inherent in the original distribution, which are integrated via a model selection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification
