Beyond Myopia: Learning from Positive and Unlabeled Data through Holistic Predictive Trends
Xinrui Wang, Wenhai Wan, Chuanxin Geng, Shaoyuan LI and, Songcan Chen

TL;DR
This paper introduces a novel holistic trend-based approach for positive and unlabeled learning, leveraging predictive score patterns over time to improve classification accuracy without extra parameter tuning.
Contribution
It proposes a TPP-inspired trend detection method for PUL, addressing bias and error accumulation issues, and demonstrates superior performance in imbalanced real-world scenarios.
Findings
Achieves up to 11.3% improvement in key metrics
Effectively models score trends as temporal point processes
Does not require additional parameter tuning or prior assumptions
Abstract
Learning binary classifiers from positive and unlabeled data (PUL) is vital in many real-world applications, especially when verifying negative examples is difficult. Despite the impressive empirical performance of recent PUL methods, challenges like accumulated errors and increased estimation bias persist due to the absence of negative labels. In this paper, we unveil an intriguing yet long-overlooked observation in PUL: \textit{resampling the positive data in each training iteration to ensure a balanced distribution between positive and unlabeled examples results in strong early-stage performance. Furthermore, predictive trends for positive and negative classes display distinctly different patterns.} Specifically, the scores (output probability) of unlabeled negative examples consistently decrease, while those of unlabeled positive examples show largely chaotic trends. Instead of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Data Stream Mining Techniques · Anomaly Detection Techniques and Applications
