TL;DR
PathGene is a comprehensive multi-center dataset linking lung cancer histopathology images with genetic mutation data, enabling the development of AI models for early genetic screening and personalized treatment planning.
Contribution
This paper introduces PathGene, a novel large-scale dataset with molecular annotations linked to histopathology images, and benchmarks multiple learning methods for mutation and exon prediction.
Findings
Benchmarking results show varying accuracy among methods
PathGene enables early genetic screening from histopathology images
The dataset facilitates development of precision oncology tools
Abstract
Accurately predicting gene mutations, mutation subtypes and their exons in lung cancer is critical for personalized treatment planning and prognostic assessment. Faced with regional disparities in medical resources and the high cost of genomic assays, using artificial intelligence to infer these mutations and exon variants from routine histopathology images could greatly facilitate precision therapy. Although some prior studies have shown that deep learning can accelerate the prediction of key gene mutations from lung cancer pathology slides, their performance remains suboptimal and has so far been limited mainly to early screening tasks. To address these limitations, we have assembled PathGene, which comprises histopathology images paired with next-generation sequencing reports from 1,576 patients at the Second Xiangya Hospital, Central South University, and 448 TCGA-LUAD patients.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
