# High-dimensional structure learning of binary pairwise Markov networks:   A comparative numerical study

**Authors:** Johan Pensar, Yingying Xu, Santeri Puranen, Maiju Pesonen, Yoshiyuki, Kabashima, Jukka Corander

arXiv: 1901.04345 · 2019-07-15

## TL;DR

This paper compares various methods for learning the structure of high-dimensional binary Markov networks, demonstrating that pairwise tests can outperform pseudo-likelihood approaches in accuracy for certain applications.

## Contribution

It provides a comprehensive numerical comparison of pseudo-likelihood and pairwise test methods for high-dimensional binary Markov network structure learning.

## Key findings

- Pairwise methods can be more accurate than pseudo-likelihood methods in high-dimensional settings.
- A parallel Gibbs sampler based on restricted Boltzmann machines is proposed for efficient sampling.
- The study offers insights into method performance across different high-dimensional scenarios.

## Abstract

Learning the undirected graph structure of a Markov network from data is a problem that has received a lot of attention during the last few decades. As a result of the general applicability of the model class, a myriad of methods have been developed in parallel in several research fields. Recently, as the size of the considered systems has increased, the focus of new methods has been shifted towards the high-dimensional domain. In particular, introduction of the pseudo-likelihood function has pushed the limits of score-based methods which were originally based on the likelihood function. At the same time, methods based on simple pairwise tests have been developed to meet the challenges arising from increasingly large data sets in computational biology. Apart from being applicable to high-dimensional problems, methods based on the pseudo-likelihood and pairwise tests are fundamentally very different. To compare the accuracy of the different types of methods, an extensive numerical study is performed on data generated by binary pairwise Markov networks. A parallelizable Gibbs sampler, based on restricted Boltzmann machines, is proposed as a tool to efficiently sample from sparse high-dimensional networks. The results of the study show that pairwise methods can be more accurate than pseudo-likelihood methods in settings often encountered in high-dimensional structure learning applications.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.04345/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1901.04345/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/1901.04345/full.md

---
Source: https://tomesphere.com/paper/1901.04345