What Makes A Good Fisherman? Linear Regression under Self-Selection Bias
Yeshwanth Cherapanamjeri, Constantinos Daskalakis, Andrew Ilyas,, Manolis Zampetakis

TL;DR
This paper develops efficient algorithms for linear regression models under self-selection bias, addressing both known and unknown model selection scenarios, with theoretical guarantees on identifiability and estimation accuracy.
Contribution
It introduces the first computationally and statistically efficient methods for linear models under self-selection, including identifiability results and algorithms for both known and unknown index cases.
Findings
Algorithms with polynomial complexity for known-index case.
Identifiability of models under max self-selection criterion.
Efficient estimation for two models with arbitrary accuracy.
Abstract
In the classical setting of self-selection, the goal is to learn models, simultaneously from observations where is the output of one of underlying models on input . In contrast to mixture models, where we observe the output of a randomly selected model, here the observed model depends on the outputs themselves, and is determined by some known selection criterion. For example, we might observe the highest output, the smallest output, or the median output of the models. In known-index self-selection, the identity of the observed model output is observable; in unknown-index self-selection, it is not. Self-selection has a long history in Econometrics and applications in various theoretical and applied fields, including treatment effect estimation, imitation learning, learning from strategically reported data, and learning from markets at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Advanced Bandit Algorithms Research · Statistical Methods and Inference
