# Un mod\`ele Bay\'esien de co-clustering de donn\'ees mixtes

**Authors:** Aichetou Bouchareb (SAMM), Marc Boull\'e, Fabrice Rossi (SAMM),, Fabrice Cl\'erot

arXiv: 1902.02056 · 2019-02-07

## TL;DR

This paper introduces a parameter-free Bayesian co-clustering method for mixed-type data tables, optimizing model quality through a Bayesian criterion and continuous optimization, demonstrated on real datasets.

## Contribution

It presents a novel MAP Bayesian approach for co-clustering mixed data types that is parameter-free and uses an exact Bayesian criterion for model evaluation.

## Key findings

- Effective co-clustering of mixed data types demonstrated on real datasets
- Parameter-free approach simplifies user interaction and model selection
- Bayesian criterion provides an exact measure of model quality

## Abstract

We propose a MAP Bayesian approach to perform and evaluate a co-clustering of mixed-type data tables. The proposed model infers an optimal segmentation of all variables then performs a co-clustering by minimizing a Bayesian model selection cost function. One advantage of this approach is that it is user parameter-free. Another main advantage is the proposed criterion which gives an exact measure of the model quality, measured by probability of fitting it to the data. Continuous optimization of this criterion ensures finding better and better models while avoiding data over-fitting. The experiments conducted on real data show the interest of this co-clustering approach in exploratory data analysis of large data sets.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.02056/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1902.02056/full.md

## References

9 references — full list in the complete paper: https://tomesphere.com/paper/1902.02056/full.md

---
Source: https://tomesphere.com/paper/1902.02056