TL;DR
USCNet is a novel Transformer-based multimodal framework that enables accurate preoperative classification of kidney stones by integrating CT images and EHR data, surpassing existing methods.
Contribution
The paper introduces USCNet, a new multimodal fusion model with segmentation guidance and a dynamic loss function for improved kidney stone classification.
Findings
USCNet achieves superior performance on in-house dataset.
The model significantly outperforms existing classification methods.
Source code is publicly available at the provided GitHub link.
Abstract
Kidney stone disease ranks among the most prevalent conditions in urology, and understanding the composition of these stones is essential for creating personalized treatment plans and preventing recurrence. Current methods for analyzing kidney stones depend on postoperative specimens, which prevents rapid classification before surgery. To overcome this limitation, we introduce a new approach called the Urinary Stone Segmentation and Classification Network (USCNet). This innovative method allows for precise preoperative classification of kidney stones by integrating Computed Tomography (CT) images with clinical data from Electronic Health Records (EHR). USCNet employs a Transformer-based multimodal fusion framework with CT-EHR attention and segmentation-guided attention modules for accurate classification. Moreover, a dynamic loss function is introduced to effectively balance the dual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
