MUVF-YOLOX: A Multi-modal Ultrasound Video Fusion Network for Renal Tumor Diagnosis
Junyu Li, Han Huang, Dong Ni, Wufeng Xue, Dongmei Zhu, Jun Cheng

TL;DR
This paper introduces MUVF-YOLOX, a multi-modal ultrasound video fusion network that combines B-mode and CEUS-mode videos using attention mechanisms and temporal aggregation to improve renal tumor diagnosis accuracy.
Contribution
The paper presents a novel multi-modal ultrasound video fusion network with an attention-based fusion module and an object-level temporal aggregation module for better renal tumor classification.
Findings
Outperforms single-modal models and existing methods on multicenter datasets.
OTA module improves classification accuracy over frame-level predictions.
Effective multi-modal feature fusion enhances tumor diagnosis.
Abstract
Early diagnosis of renal cancer can greatly improve the survival rate of patients. Contrast-enhanced ultrasound (CEUS) is a cost-effective and non-invasive imaging technique and has become more and more frequently used for renal tumor diagnosis. However, the classification of benign and malignant renal tumors can still be very challenging due to the highly heterogeneous appearance of cancer and imaging artifacts. Our aim is to detect and classify renal tumors by integrating B-mode and CEUS-mode ultrasound videos. To this end, we propose a novel multi-modal ultrasound video fusion network that can effectively perform multi-modal feature fusion and video classification for renal tumor diagnosis. The attention-based multi-modal fusion module uses cross-attention and self-attention to extract modality-invariant features and modality-specific features in parallel. In addition, we design an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Radiomics and Machine Learning in Medical Imaging · MRI in cancer diagnosis
