In-Context Positive-Unlabeled Learning
Siyan Liu, Yi Chang, Manli Cheng, Qinglong Tian, Pengfei Li

TL;DR
This paper introduces PUICL, a pretrained transformer that performs positive-unlabeled learning through in-context learning, eliminating the need for dataset-specific training or tuning.
Contribution
PUICL is the first transformer-based model to solve PU classification via in-context learning, trained on synthetic datasets to handle various feature-label relationships.
Findings
PUICL outperforms four standard PU baselines in AUC and accuracy.
PUICL is competitive on F1-score across 20 semi-synthetic benchmarks.
The approach demonstrates in-context learning extends to semi-supervised tabular classification.
Abstract
Positive-unlabeled (PU) learning addresses binary classification when only a set of labeled positives is available alongside a pool of unlabeled samples drawn from a mixture of positives and negatives. Existing PU methods typically require dataset-specific training or iterative optimization, which limits their applicability when many tasks must be solved quickly or with little tuning. We introduce PUICL, a pretrained transformer that solves PU classification entirely through in-context learning. PUICL is pretrained on synthetic PU datasets generated from randomly instantiated structural causal models, exposing it to a wide range of feature-label relationships and class-prior configurations. At inference time, PUICL receives the labeled positives and the unlabeled samples as a single input and returns class probabilities for the unlabeled rows in one forward pass, with no gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
