High-Dimensional Undirected Graphical Models for Arbitrary Mixed Data
Konstantin G\"obler, Anne Miloschewski, Mathias Drton, Sach, Mukherjee

TL;DR
This paper introduces a scalable method for learning graphical models from complex datasets with mixed variable types by leveraging latent Gaussian copula models and classical correlation ideas, enabling joint analysis of diverse data.
Contribution
It proposes a novel, flexible approach for modeling mixed data types using latent Gaussian copulas, extending existing methods to more general variable combinations.
Findings
Method performs well in simulations.
Successfully applied to UK Biobank COVID-19 data.
Provides theoretical and empirical validation.
Abstract
Graphical models are an important tool in exploring relationships between variables in complex, multivariate data. Methods for learning such graphical models are well developed in the case where all variables are either continuous or discrete, including in high-dimensions. However, in many applications data span variables of different types (e.g. continuous, count, binary, ordinal, etc.), whose principled joint analysis is nontrivial. Latent Gaussian copula models, in which all variables are modeled as transformations of underlying jointly Gaussian variables, represent a useful approach. Recent advances have shown how the binary-continuous case can be tackled, but the general mixed variable type regime remains challenging. In this work, we make the simple yet useful observation that classical ideas concerning polychoric and polyserial correlations can be leveraged in a latent Gaussian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Time Series Analysis and Forecasting
