Improvements to Supervised EM Learning of Shared Kernel Models by Feature Space Partitioning
Graham W. Pulford

TL;DR
This paper enhances supervised EM learning for shared kernel models by rigorously deriving the algorithm and reducing computational complexity through feature space partitioning, enabling efficient training on higher-dimensional data.
Contribution
It provides a detailed EM derivation for the Gaussian shared kernel model and introduces feature space partitioning to improve efficiency and scalability.
Findings
Partitioned SKEM achieves lower complexity
Improved performance on MNIST dataset
Comparable or better accuracy than standard classifiers
Abstract
Expectation maximisation (EM) is usually thought of as an unsupervised learning method for estimating the parameters of a mixture distribution, however it can also be used for supervised learning when class labels are available. As such, EM has been applied to train neural nets including the probabilistic radial basis function (PRBF) network or shared kernel (SK) model. This paper addresses two major shortcomings of previous work in this area: the lack of rigour in the derivation of the EM training algorithm; and the computational complexity of the technique, which has limited it to low dimensional data sets. We first present a detailed derivation of EM for the Gaussian shared kernel model PRBF classifier, making use of data association theory to obtain the complete data likelihood, Baum's auxiliary function (the E-step) and its subsequent maximisation (M-step). To reduce complexity of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Neural Networks and Applications · Bayesian Methods and Mixture Models
