Parameter Estimation for Grouped Data Using EM and MCEM Algorithms
Zahra A. Shirazi, Jo\~ao Pedro A. R. da Silva, Camila P. E. de, Souza

TL;DR
This paper develops EM and MCEM algorithms to estimate mean and variance from grouped data, addressing confidentiality concerns by working with approximate data instead of exact measurements.
Contribution
It introduces EM and MCEM methods tailored for grouped data under multivariate normal assumptions, providing a novel approach for privacy-preserving data analysis.
Findings
Algorithms perform well in simulation studies
Effective estimation demonstrated on Galton data set
Applicable to univariate, bivariate, and multivariate cases
Abstract
Nowadays, the confidentiality of data and information is of great importance for many companies and organizations. For this reason, they may prefer not to release exact data, but instead to grant researchers access to approximate data. For example, rather than providing the exact measurements of their clients, they may only provide researchers with grouped data, that is, the number of clients falling in each of a set of non-overlapping measurement intervals. The challenge is to estimate the mean and variance structure of the hidden ungrouped data based on the observed grouped data. To tackle this problem, this work considers the exact observed data likelihood and applies the Expectation-Maximization (EM) and Monte-Carlo EM (MCEM) algorithms for cases where the hidden data follow a univariate, bivariate, or multivariate normal distribution. Simulation studies are conducted to evaluate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Advanced Statistical Process Monitoring · Bayesian Methods and Mixture Models
