An Effective Flow-based Method for Positive-Unlabeled Learning: 2-HNC
Dorit Hochbaum, Torpong Nitayanont

TL;DR
This paper introduces 2-HNC, a flow-based method for positive-unlabeled learning that ranks unlabeled samples by their likelihood of being negative using nested partitions derived from Hochbaum's Normalized Cut, improving classification accuracy.
Contribution
The paper presents a novel flow-based approach, 2-HNC, which leverages nested partitions from Hochbaum's Normalized Cut to effectively rank and classify unlabeled data in PU learning scenarios.
Findings
2-HNC outperforms existing methods on synthetic datasets.
The method achieves competitive results on real-world datasets.
It provides a reliable ranking of unlabeled samples by negative likelihood.
Abstract
In many scenarios of binary classification, only positive instances are provided in the training data, leaving the rest of the data unlabeled. This setup, known as positive-unlabeled (PU) learning, is addressed here with a network flow-based method which utilizes pairwise similarities between samples. The method we propose here, 2-HNC, leverages Hochbaum's Normalized Cut (HNC) and the set of solutions it provides by solving a parametric minimum cut problem. The set of solutions, that are nested partitions of the samples into two sets, correspond to varying tradeoff values between the two goals: high intra-similarity inside the sets and low inter-similarity between the two sets. This nested sequence is utilized here to deliver a ranking of unlabeled samples by their likelihood of being negative. Building on this insight, our method, 2-HNC, proceeds in two stages. The first stage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training
