SSLfmm: An R Package for Semi-Supervised Learning with a Mixed-Missingness Mechanism in Finite Mixture Models
Geoffrey J. McLachlan, Jinran Wu

TL;DR
SSLfmm is an R package that models semi-supervised learning with mixed missingness mechanisms, leveraging label missingness information to improve classification in finite mixture models with Gaussian components.
Contribution
It introduces a novel approach to semi-supervised learning that accounts for both MCAR and MAR missingness, enhancing classifier estimation in finite mixture models.
Findings
Potentially lower misclassification rates than supervised methods
Effective modeling of mixed missingness improves estimation
Demonstrated through simulated examples
Abstract
Semi-supervised learning (SSL) constructs classifiers from datasets in which only a subset of observations is labelled, a situation that naturally arises because obtaining labels often requires expert judgement or costly manual effort. This motivates methods that integrate labelled and unlabelled data within a learning framework. Most SSL approaches assume that label absence is harmless, typically treated as missing completely at random or ignored, but in practice, the missingness process can be informative, as the chances of an observation being unlabelled may depend on the ambiguity of its feature vector. In such cases, the missingness indicators themselves provide additional information that, if properly modelled, may improve estimation efficiency. The \textbf{SSLfmm} package for R is designed to capture this behaviour by estimating the Bayes' classifier under a finite mixture model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Machine Learning and Data Classification · Statistical Methods and Inference
