Learning Shared Kernel Models: the Shared Kernel EM algorithm
Graham W. Pulford

TL;DR
This paper introduces the shared kernel EM (SKEM) algorithm, a rigorous derivation of a shared kernel mixture model, applied to digit recognition, with comparisons of variants and source code provided.
Contribution
It provides a theoretically rigorous derivation of the shared kernel EM algorithm for Gaussian mixture models, addressing previous shortcomings and applying it to digit recognition.
Findings
SKEM achieves competitive accuracy in digit recognition.
Different feature and mixture component configurations impact performance.
Simplified classifiers decompose joint PDFs for efficiency.
Abstract
Expectation maximisation (EM) is an unsupervised learning method for estimating the parameters of a finite mixture distribution. It works by introducing "hidden" or "latent" variables via Baum's auxiliary function that allow the joint data likelihood to be expressed as a product of simple factors. The relevance of EM has increased since the introduction of the variational lower bound (VLB): the VLB differs from Baum's auxiliary function only by the entropy of the PDF of the latent variables . We first present a rederivation of the standard EM algorithm using data association ideas from the field of multiple target tracking, using -valued scalar data association hypotheses rather than the usual binary indicator vectors. The same method is then applied to a little known but much more general type of supervised EM algorithm for shared kernel models, related to probabilistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Bayesian Methods and Mixture Models · Face and Expression Recognition
