Semi-Supervised Active Learning for Support Vector Machines: A Novel Approach that Exploits Structure Information in Data
Tobias Reitmaier, Adrian Calma, Bernhard Sick

TL;DR
This paper introduces a novel semi-supervised active learning approach for SVMs that leverages data structure information through probabilistic models, improving classification performance on benchmark datasets.
Contribution
It combines active learning with semi-supervised SVMs using probabilistic models and a new kernel, enhancing data utilization and classification accuracy.
Findings
Significantly better results than state-of-the-art methods on benchmark datasets.
Effective use of structure information improves learning efficiency.
Fusion of generative and discriminative models enhances SVM performance.
Abstract
In our today's information society more and more data emerges, e.g.~in social networks, technical applications, or business applications. Companies try to commercialize these data using data mining or machine learning methods. For this purpose, the data are categorized or classified, but often at high (monetary or temporal) costs. An effective approach to reduce these costs is to apply any kind of active learning (AL) methods, as AL controls the training process of a classifier by specific querying individual data points (samples), which are then labeled (e.g., provided with class memberships) by a domain expert. However, an analysis of current AL research shows that AL still has some shortcomings. In particular, the structure information given by the spatial pattern of the (un)labeled data in the input space of a classification model (e.g.,~cluster information), is used in an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Text and Document Classification Technologies
MethodsSupport Vector Machine
