Pulmonologists-Level lung cancer detection based on standard blood test results and smoking status using an explainable machine learning approach
Ricco Noel Hansen Flyckt, Louise Sjodsholm, Margrethe H{\o}stgaard, Bang Henriksen, Claus Lohman Brasen, Ali Ebrahimi, Ole Hilberg, Torben, Fr{\o}strup Hansen, Uffe Kock Wiil, Lars Henrik Jensen, Abdolrahman Peimankar

TL;DR
This study develops an explainable machine learning model using blood test results and smoking history to detect lung cancer early, outperforming pulmonologists in sensitivity and aiding timely diagnosis.
Contribution
The paper introduces a dynamic ensemble selection ML model for lung cancer detection based on standard blood tests and smoking data, demonstrating superior performance over pulmonologists.
Findings
ML model achieved AUC of 0.77 and sensitivity of 76.2%.
Model outperformed pulmonologists with 9% higher sensitivity.
Key factors included smoking status, age, calcium, neutrophils, and lactate dehydrogenase.
Abstract
Lung cancer (LC) remains the primary cause of cancer-related mortality, largely due to late-stage diagnoses. Effective strategies for early detection are therefore of paramount importance. In recent years, machine learning (ML) has demonstrated considerable potential in healthcare by facilitating the detection of various diseases. In this retrospective development and validation study, we developed an ML model based on dynamic ensemble selection (DES) for LC detection. The model leverages standard blood sample analysis and smoking history data from a large population at risk in Denmark. The study includes all patients examined on suspicion of LC in the Region of Southern Denmark from 2009 to 2018. We validated and compared the predictions by the DES model with diagnoses provided by five pulmonologists. Among the 38,944 patients, 9,940 had complete data of which 2,505 (25\%) had LC. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLung Cancer Diagnosis and Treatment · Radiomics and Machine Learning in Medical Imaging
