Learning Mixtures of Bernoulli Templates by Two-Round EM with Performance Guarantee
Adrian Barbu, Tianfu Wu, Ying Nian Wu

TL;DR
This paper extends the two-round EM algorithm's theoretical guarantees from Gaussian mixtures to high-dimensional Bernoulli templates, enabling effective learning of binary feature-based image models with high probability.
Contribution
It generalizes the two-round EM algorithm's performance guarantees to Bernoulli templates, broadening its applicability to binary feature models in high-dimensional settings.
Findings
Two-round EM learns Bernoulli mixtures with high accuracy.
Performance depends on template separation and feature count.
Validated with synthetic and real data examples.
Abstract
Dasgupta and Shulman showed that a two-round variant of the EM algorithm can learn mixture of Gaussian distributions with near optimal precision with high probability if the Gaussian distributions are well separated and if the dimension is sufficiently high. In this paper, we generalize their theory to learning mixture of high-dimensional Bernoulli templates. Each template is a binary vector, and a template generates examples by randomly switching its binary components independently with a certain probability. In computer vision applications, a binary vector is a feature map of an image, where each binary component indicates whether a local feature or structure is present or absent within a certain cell of the image domain. A Bernoulli template can be considered as a statistical model for images of objects (or parts of objects) from the same category. We show that the two-round EM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
