Angular Regularization for Positive-Unlabeled Learning on the Hypersphere
Vasileios Sevetlidis, George Pavlidis, Antonios Gasteratos

TL;DR
AngularPU introduces a hypersphere-based positive-unlabeled learning framework that employs cosine similarity and angular regularization, achieving improved performance and interpretability in high-dimensional, scarce-positive scenarios.
Contribution
It proposes a novel PU learning method on the hypersphere using angular margin and regularization, with theoretical guarantees and superior empirical results.
Findings
Achieves competitive or superior performance on benchmark datasets.
Effective in high-dimensional and scarce-positive settings.
Provides geometric interpretability and scalability.
Abstract
Positive-Unlabeled (PU) learning addresses classification problems where only a subset of positive examples is labeled and the remaining data is unlabeled, making explicit negative supervision unavailable. Existing PU methods often rely on negative-risk estimation or pseudo-labeling, which either require strong distributional assumptions or can collapse in high-dimensional settings. We propose AngularPU, a novel PU framework that operates on the unit hypersphere using cosine similarity and angular margin. In our formulation, the positive class is represented by a learnable prototype vector, and classification reduces to thresholding the cosine similarity between an embedding and this prototype-eliminating the need for explicit negative modeling. To counteract the tendency of unlabeled embeddings to cluster near the positive prototype, we introduce an angular regularizer that encourages…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Imbalanced Data Classification Techniques
