Distance-based Positive and Unlabeled Learning for Ranking
Hayden S. Helm, Amitabh Basu, Avanti Athreya, Youngser Park, Joshua T., Vogelstein, Carey E. Priebe, Michael Winding, Marta Zlatic, Albert Cardona,, Patrick Bourke, Jonathan Larson, Marah Abdin, Piali Choudhury, Weiwei Yang,, Christopher W. White

TL;DR
This paper introduces a novel ranking method that leverages positive and unlabeled data using distance metrics and integer linear programming, effective even with minimal supervision, demonstrated in graph vertex nomination.
Contribution
It proposes a model-agnostic distance-based ranking approach utilizing ILP with limited supervision, applicable to various domains including graph vertex nomination.
Findings
Effective ranking with minimal supervision demonstrated
Method outperforms traditional approaches in simulations
Applicable to real-world graph data
Abstract
Learning to rank -- producing a ranked list of items specific to a query and with respect to a set of supervisory items -- is a problem of general interest. The setting we consider is one in which no analytic description of what constitutes a good ranking is available. Instead, we have a collection of representations and supervisory information consisting of a (target item, interesting items set) pair. We demonstrate analytically, in simulation, and in real data examples that learning to rank via combining representations using an integer linear program is effective when the supervision is as light as "these few items are similar to your item of interest." While this nomination task is quite general, for specificity we present our methodology from the perspective of vertex nomination in graphs. The methodology described herein is model agnostic.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Data Management and Algorithms · Game Theory and Voting Systems
