Multi-scale Cascaded Foundation Model for Whole-body Organs-at-risk Segmentation
Rui Hao, Dayu Tan, Qiankun Li, Chunhou Zheng, Weimin Zhong, Zhigang Zeng

TL;DR
This paper introduces MCFNet, a multi-scale cascaded fusion network that significantly improves whole-body organs-at-risk segmentation accuracy, robustness, and efficiency, supporting safer radiotherapy and surgical planning.
Contribution
The paper presents a novel multi-scale cascaded fusion architecture with specialized backbones, enhancing boundary localization and fine structure preservation in OAR segmentation.
Findings
Outperforms existing methods in organ segmentation accuracy
Demonstrates strong cross-dataset generalization
Provides reliable, efficient segmentation even on low-resolution inputs
Abstract
Accurate segmentation of organs-at-risk (OARs) is vital for safe and precise radiotherapy and surgery. Most existing studies segment only a limited set of organs or regions, lacking a systematic treatment of OARs segmentation. We present a Multi-scale Cascaded Fusion Network (MCFNet) that aggregates features across multiple scales and resolutions. MCFNet consists of a Sharp Extraction Backbone for the downsampling path and a Flexible Connection Backbone for skip-connection fusion, strengthening representation learning in both stages. This design improves boundary localization and preserves fine structures while maintaining computational efficiency, enabling reliable performance even on low-resolution inputs. Experiments on an NVIDIA A6000 GPU using 36,131 image-mask pairs from 671 patients across 10 datasets show consistent robustness and strong cross-dataset generalization. An adaptive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging
MethodsAdaptive Robust Loss
