Stable Graphical Model Estimation with Random Forests for Discrete, Continuous, and Mixed Variables
Bernd Fellinghauer, Peter B\"uhlmann, Martin Ryffel, Michael von Rhein, and Jan D. Reinhardt

TL;DR
This paper introduces Graphical Random Forests (GRaFo), a new method for estimating conditional independence graphs in mixed-type data, with stability selection for error control, and demonstrates its effectiveness on simulated and real-world health data.
Contribution
GRaFo is a novel approach that effectively estimates graphical models for mixed data types, incorporating stability selection for controlling false positives.
Findings
GRaFo performs well with mixed data types.
Stability selection effectively controls false positive edges.
Application to health data reveals meaningful variable connections.
Abstract
A conditional independence graph is a concise representation of pairwise conditional independence among many variables. Graphical Random Forests (GRaFo) are a novel method for estimating pairwise conditional independence relationships among mixed-type, i.e. continuous and discrete, variables. The number of edges is a tuning parameter in any graphical model estimator and there is no obvious number that constitutes a good choice. Stability Selection helps choosing this parameter with respect to a bound on the expected number of false positives (error control). The performance of GRaFo is evaluated and compared with various other methods for p = 50, 100, and 200 possibly mixed-type variables while sample size is n = 100 (n = 500 for maximum likelihood). Furthermore, GRaFo is applied to data from the Swiss Health Survey in order to evaluate how well it can reproduce the interconnection of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
