A Semiparametric Efficient Approach To Label Shift Estimation and Quantification
Brandon Tse Wei Chow

TL;DR
This paper introduces SELSE, a semiparametric efficient method for estimating label shift in transfer learning, demonstrating superior performance over existing algorithms especially with large test datasets.
Contribution
The paper proposes SELSE, a novel semiparametric efficient estimator for label shift, and proves its optimality among a broad class of quantification algorithms.
Findings
SELSE outperforms existing methods in empirical tests.
SELSE is especially effective with large test sample sizes.
Theoretical proof of SELSE's semiparametric efficiency.
Abstract
Transfer Learning is an area of statistics and machine learning research that seeks answers to the following question: how do we build successful learning algorithms when the data available for training our model is qualitatively different from the data we hope the model will perform well on? In this thesis, we focus on a specific area of Transfer Learning called label shift, also known as quantification. In quantification, the aforementioned discrepancy is isolated to a shift in the distribution of the response variable. In such a setting, accurately inferring the response variable's new distribution is both an important estimation task in its own right and a crucial step for ensuring that the learning algorithm can adapt to the new data. We make two contributions to this field. First, we present a new procedure called SELSE which estimates the shift in the response variable's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Neural Networks and Applications
MethodsTest
