Regression Analysis with Response-biased Sampling
Kani Chen, Yuanyuan Lin, Yuan Yao, Chaoxu Zhou

TL;DR
This paper introduces a robust regression method for response-biased sampling using maximum rank correlation estimation, applicable to various models without needing sampling probabilities or error distribution assumptions.
Contribution
It develops a valid, consistent, and asymptotically normal estimation approach for response-biased sampling that is easy to implement and does not require complex density estimation.
Findings
Method performs well in numerical studies.
Applicable to multiple transformation models.
Demonstrated with real-world data applications.
Abstract
Response-biased sampling, in which samples are drawn from a popula- tion according to the values of the response variable, is common in biomedical, epidemiological, economic and social studies. In particular, the complete obser- vations in data with censoring, truncation or missing covariates can be regarded as response-biased sampling under certain conditions. This paper proposes to use transformation models, known as the generalized accelerated failure time model in econometrics, for regression analysis with response-biased sampling. With unknown error distribution, the transformation models are broad enough to cover linear re- gression models, the Cox's model and the proportional odds model as special cases. To the best of our knowledge, except for the case-control logistic regression, there is no report in the literature that a prospective estimation approach can work for biased…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Bayesian Methods and Mixture Models
