Uncertainty in Lung Cancer Stage for Outcome Estimation via Set-Valued Classification
Savannah Bergquist, Gabriel Brooks, Mary Beth Landrum, Nancy Keating,, Sherri Rose

TL;DR
This paper introduces a method that incorporates uncertainty from set-valued classification to improve outcome estimation in lung cancer staging using claims data, enhancing reliability across different cohorts.
Contribution
It develops a novel approach combining set-valued classification and split conformal inference to account for uncertainty in cancer stage prediction across datasets.
Findings
Effective classification of lung cancer stages from claims data.
Rigorous uncertainty quantification in outcome estimation.
Method demonstrated on SEER-Medicare linked data.
Abstract
Difficulty in identifying cancer stage in health care claims data has limited oncology quality of care and health outcomes research. We fit prediction algorithms for classifying lung cancer stage into three classes (stages I/II, stage III, and stage IV) using claims data, and then demonstrate a method for incorporating the classification uncertainty in outcomes estimation. Leveraging set-valued classification and split conformal inference, we show how a fixed algorithm developed in one cohort of data may be deployed in another, while rigorously accounting for uncertainty from the initial classification step. We demonstrate this process using SEER cancer registry data linked with Medicare claims data.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference
