Beyond Myopia: Learning from Positive and Unlabeled Data through   Holistic Predictive Trends

Xinrui Wang; Wenhai Wan; Chuanxin Geng; Shaoyuan LI and; Songcan Chen

arXiv:2310.04078·cs.LG·October 14, 2024·2 cites

Beyond Myopia: Learning from Positive and Unlabeled Data through Holistic Predictive Trends

Xinrui Wang, Wenhai Wan, Chuanxin Geng, Shaoyuan LI and, Songcan Chen

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel holistic trend-based approach for positive and unlabeled learning, leveraging predictive score patterns over time to improve classification accuracy without extra parameter tuning.

Contribution

It proposes a TPP-inspired trend detection method for PUL, addressing bias and error accumulation issues, and demonstrates superior performance in imbalanced real-world scenarios.

Findings

01

Achieves up to 11.3% improvement in key metrics

02

Effectively models score trends as temporal point processes

03

Does not require additional parameter tuning or prior assumptions

Abstract

Learning binary classifiers from positive and unlabeled data (PUL) is vital in many real-world applications, especially when verifying negative examples is difficult. Despite the impressive empirical performance of recent PUL methods, challenges like accumulated errors and increased estimation bias persist due to the absence of negative labels. In this paper, we unveil an intriguing yet long-overlooked observation in PUL: \textit{resampling the positive data in each training iteration to ensure a balanced distribution between positive and unlabeled examples results in strong early-stage performance. Furthermore, predictive trends for positive and negative classes display distinctly different patterns.} Specifically, the scores (output probability) of unlabeled negative examples consistently decrease, while those of unlabeled positive examples show largely chaotic trends. Instead of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wxr99/holisticpu
pytorchOfficial

Videos

Beyond Myopia: Learning from Positive and Unlabeled Data through Holistic Predictive Trends· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Data Stream Mining Techniques · Anomaly Detection Techniques and Applications