Boosting Medical Image-based Cancer Detection via Text-guided Supervision from Reports
Guangyu Guo, Jiawen Yao, Yingda Xia, Tony C. W. Mok, Zhilin Zheng,, Junwei Han, Le Lu, Dingwen Zhang, Jian Zhou, Ling Zhang

TL;DR
This paper introduces a text-guided semi-supervised learning approach that leverages clinical reports and vision-language models to improve cancer detection accuracy in medical imaging while significantly reducing the need for expert annotations.
Contribution
The novel method integrates diagnostic and tumor location prompts into a vision-language model to enhance weakly supervised cancer detection in 3D medical scans.
Findings
Reduces human annotation efforts by at least 70%.
Achieves comparable detection accuracy to fully supervised methods.
Validates effectiveness on a large-scale dataset with 1,651 patients.
Abstract
The absence of adequately sufficient expert-level tumor annotations hinders the effectiveness of supervised learning based opportunistic cancer screening on medical imaging. Clinical reports (that are rich in descriptive textual details) can offer a "free lunch'' supervision information and provide tumor location as a type of weak label to cope with screening tasks, thus saving human labeling workloads, if properly leveraged. However, predicting cancer only using such weak labels can be very changeling since tumors are usually presented in small anatomical regions compared to the whole 3D medical scans. Weakly semi-supervised learning (WSSL) utilizes a limited set of voxel-level tumor annotations and incorporates alongside a substantial number of medical images that have only off-the-shelf clinical reports, which may strike a good balance between minimizing expert annotation workload…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling
MethodsSparse Evolutionary Training
