SAM-FNet: SAM-Guided Fusion Network for Laryngo-Pharyngeal Tumor Detection
Jia Wei, Yun Li, Meiyu Qiu, Hongyu Chen, Xiaomao Fan, Wenbin Lei

TL;DR
This paper introduces SAM-FNet, a dual-branch network leveraging SAM for precise lesion segmentation and a GAN-like module for discriminative feature fusion, significantly improving laryngo-pharyngeal tumor detection accuracy.
Contribution
The novel integration of SAM for lesion segmentation and a GAN-like feature optimization module in a dual-branch network for enhanced tumor detection.
Findings
Outperforms state-of-the-art methods on two LPC datasets.
Achieves high accuracy in lesion localization.
Demonstrates robustness across different datasets.
Abstract
Laryngo-pharyngeal cancer (LPC) is a highly fatal malignant disease affecting the head and neck region. Previous studies on endoscopic tumor detection, particularly those leveraging dual-branch network architectures, have shown significant advancements in tumor detection. These studies highlight the potential of dual-branch networks in improving diagnostic accuracy by effectively integrating global and local (lesion) feature extraction. However, they are still limited in their capabilities to accurately locate the lesion region and capture the discriminative feature information between the global and local branches. To address these issues, we propose a novel SAM-guided fusion network (SAM-FNet), a dual-branch network for laryngo-pharyngeal tumor detection. By leveraging the powerful object segmentation capabilities of the Segment Anything Model (SAM), we introduce the SAM into the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHead and Neck Cancer Studies
MethodsSegment Anything Model
