Rethinking Class-Prior Estimation for Positive-Unlabeled Learning

Yu Yao; Tongliang Liu; Bo Han; Mingming Gong; Gang Niu and; Masashi Sugiyama; Dacheng Tao

arXiv:2002.03673·cs.LG·June 6, 2022·6 cites

Rethinking Class-Prior Estimation for Positive-Unlabeled Learning

Yu Yao, Tongliang Liu, Bo Han, Mingming Gong, Gang Niu and, Masashi Sugiyama, Dacheng Tao

PDF

Open Access 1 Video

TL;DR

This paper introduces ReCPE, a novel method for positive-unlabeled learning that removes the support containment assumption in class-prior estimation, improving accuracy across various datasets.

Contribution

ReCPE is a new approach that enhances existing CPE methods by removing the support containment assumption, making PU learning more robust.

Findings

01

ReCPE improves all tested state-of-the-art CPE methods.

02

ReCPE reduces positive bias when the support assumption is violated.

03

Empirical results show consistent performance gains across datasets.

Abstract

Given only positive (P) and unlabeled (U) data, PU learning can train a binary classifier without any negative data. It has two building blocks: PU class-prior estimation (CPE) and PU classification; the latter has been well studied while the former has received less attention. Hitherto, the distributional-assumption-free CPE methods rely on a critical assumption that the support of the positive data distribution cannot be contained in the support of the negative data distribution. If this is violated, those CPE methods will systematically overestimate the class prior; it is even worse that we cannot verify the assumption based on the data. In this paper, we rethink CPE for PU learning-can we remove the assumption to make CPE always valid? We show an affirmative answer by proposing Regrouping CPE (ReCPE) that builds an auxiliary probability distribution such that the support of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Rethinking Class-Prior Estimation for Positive-Unlabeled Learning· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Imbalanced Data Classification Techniques

MethodsCollaborative Preference Embedding