Risk Bounds for Low Cost Bipartite Ranking
San Gultekin, John Paisley

TL;DR
This paper introduces a low-cost, stochastic algorithm for bipartite ranking that avoids the quadratic sample dependence of traditional methods, backed by novel theoretical bounds and competitive experimental results.
Contribution
It presents a new stochastic algorithm leveraging pairwise squared loss structure with a novel uniform risk bound, reducing sample complexity.
Findings
Significant speed improvements over batch algorithms
Competitive performance on real datasets
Sample complexity does not have quadratic dependence
Abstract
Bipartite ranking is an important supervised learning problem; however, unlike regression or classification, it has a quadratic dependence on the number of samples. To circumvent the prohibitive sample cost, many recent work focus on stochastic gradient-based methods. In this paper we consider an alternative approach, which leverages the structure of the widely-adopted pairwise squared loss, to obtain a stochastic and low cost algorithm that does not require stochastic gradients or learning rates. Using a novel uniform risk bound---based on matrix and vector concentration inequalities---we show that the sample size required for competitive performance against the all-pairs batch algorithm does not have a quadratic dependence. Generalization bounds for both the batch and low cost stochastic algorithms are presented. Experimental results show significant speed gain against the batch…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research · Bayesian Modeling and Causal Inference
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
