Semi-supervised Learning with Deterministic Labeling and Large Margin Projection
Ji Xu, Gang Ren, Yao Xiao, Shaobo Li, Guoyin Wang

TL;DR
This paper introduces a novel semi-supervised learning method that constructs an optimal leading forest for representative sample selection and large margin metric learning, significantly improving stability and accuracy over existing graph-based SSL methods.
Contribution
It proposes a new sample selection strategy using an optimal leading forest and a kernelized large margin metric for improved semi-supervised learning performance.
Findings
Enhanced stability and accuracy compared to state-of-the-art methods
Effective handling of multi-modal and large class problems
Achieved encouraging accuracy and efficiency in experiments
Abstract
The centrality and diversity of the labeled data are very influential to the performance of semi-supervised learning (SSL), but most SSL models select the labeled data randomly. This study first construct a leading forest that forms a partially ordered topological space in an unsupervised way, and select a group of most representative samples to label with one shot (differs from active learning essentially) using property of homeomorphism. Then a kernelized large margin metric is efficiently learned for the selected data to classify the remaining unlabeled sample. Optimal leading forest (OLF) has been observed to have the advantage of revealing the difference evolution along a path within a subtree. Therefore, we formulate an optimization problem based on OLF to select the samples. Also with OLF, the multiple local metrics learning is facilitated to address multi-modal and mix-modal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Advanced Computing and Algorithms · Text and Document Classification Technologies
