Distributional Random Forests for Complex Survey Designs on Reproducing Kernel Hilbert Spaces
Yating Zou, Marcos Matabuena, Michael R. Kosorok

TL;DR
This paper introduces a survey-calibrated distributional random forest method for estimating conditional distributions under complex survey designs, providing theoretical guarantees and practical applications in health data analysis.
Contribution
It develops a novel distributional random forest framework that accounts for survey design features, with proven consistency and applicability to real-world complex survey data.
Findings
Finite sample performance demonstrated through simulations.
First model-free estimation of conditional distributions under survey designs.
Application to NHANES data for subgroup risk profiling.
Abstract
We study estimation of the conditional law and continuous functionals when takes values in a locally compact Polish space, , and the observations arise from a complex survey design. We propose a survey-calibrated distributional random forest (SDRF) that incorporates complex-design features via a pseudo-population bootstrap, PSU-level honesty, and a Maximum Mean Discrepancy (MMD) split criterion computed from kernel mean embeddings of H\'{a}jek-type (design-weighted) node distributions. We provide a framework for analyzing forest-style estimators under survey designs; establish design consistency for the finite-population target and model consistency for the super-population target under explicit conditions on the design, kernel, resampling multipliers, and tree partitions. As far as we are aware, these are the first results on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Bayesian Methods and Mixture Models
