A Family of Blockwise One-Factor Distributions for Modelling High-Dimensional Binary Data
Matthieu Marbac, Mohammed Sedki

TL;DR
This paper introduces a new family of blockwise one-factor distributions for high-dimensional binary data, providing explicit probabilities, easy interpretation, and efficient parameter estimation, with demonstrated effectiveness on real and simulated data.
Contribution
The paper presents a novel family of one-factor distributions with explicit probabilities and a blockwise extension, improving interpretability and computational efficiency for high-dimensional binary data modeling.
Findings
Explicit probability formulas avoid numeric approximations
Model interpretation is simplified with continuous and binary parameters
Effective parameter estimation via EM algorithm and model selection method
Abstract
We introduce a new family of one factor distributions for high-dimensional binary data. The model provides an explicit probability for each event, thus avoiding the numeric approximations often made by existing methods. Model interpretation is easy since each variable is described by two continuous parameters (corresponding to its marginal probability and to its strength of dependency with the other variables) and by one binary parameter (defining if the dependencies are positive or negative). An extension of this new model is proposed by assuming that the variables are split into independent blocks which follow the new one factor distribution. Parameter estimation is performed by the inference margin procedure where the second step is achieved by an expectation-maximization algorithm. Model selection is carried out by a deterministic approach which strongly reduces the number of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Bayesian Methods and Mixture Models · Statistical Methods and Inference
