Pairwise Supervision Can Provably Elicit a Decision Boundary
Han Bao, Takuya Shimada, Liyuan Xu, Issei Sato, Masashi Sugiyama

TL;DR
This paper demonstrates theoretically that pairwise similarity supervision can effectively induce a decision boundary for binary classification, establishing a formal link between similarity learning and classification performance.
Contribution
It provides the first provable theoretical connection between similarity learning and the ability to perform binary classification with a decision boundary.
Findings
Similarity learning relates strongly to binary classification objectives.
An excess risk bound connects similarity learning to classification performance.
Similarity supervision can provably elicit a decision boundary.
Abstract
Similarity learning is a general problem to elicit useful representations by predicting the relationship between a pair of patterns. This problem is related to various important preprocessing tasks such as metric learning, kernel learning, and contrastive learning. A classifier built upon the representations is expected to perform well in downstream classification; however, little theory has been given in literature so far and thereby the relationship between similarity and classification has remained elusive. Therefore, we tackle a fundamental question: can similarity information provably leads a model to perform well in downstream classification? In this paper, we reveal that a product-type formulation of similarity learning is strongly related to an objective of binary classification. We further show that these two different problems are explicitly connected by an excess risk bound.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Text and Document Classification Technologies · Machine Learning and Data Classification
