Improved Baselines with Synchronized Encoding for Universal Medical Image Segmentation
Sihan Yang, Jiadong Feng, Xuande Mi, Haixia Bi, Hai Zhang, Jian Sun

TL;DR
SyncSAM introduces a synchronized dual-branch encoder and multi-scale decoder to improve medical image segmentation, achieving state-of-the-art results and strong zero-shot performance across diverse datasets.
Contribution
The paper proposes SyncSAM, a novel architecture combining convolution and Transformer features with synchronized encoding for enhanced medical image segmentation.
Findings
Achieves state-of-the-art segmentation performance on test datasets.
Exhibits strong zero-shot generalization to unseen datasets.
Provides a new strong baseline for future medical segmentation research.
Abstract
Large foundation models, known for their strong zero-shot generalization capabilities, can be applied to a wide range of downstream tasks. However, developing foundation models for medical image segmentation poses a significant challenge due to the domain gap between natural and medical images. While fine-tuning techniques based on the Segment Anything Model (SAM) have been explored, they primarily focus on scaling up data or refining inference strategies without incorporating domain-specific architectural designs, limiting their zero-shot performance. To optimize segmentation performance under standard inference settings and provide a strong baseline for future research, we introduce SyncSAM, which employs a synchronized dual-branch encoder that integrates convolution and Transformer features in a synchronized manner to enhance medical image encoding, and a multi-scale dual-branch…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Medical Imaging Techniques and Applications · AI in cancer detection
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Linear Layer · Max Pooling · Residual Connection · Layer Normalization · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam · Attention Is All You Need · Byte Pair Encoding
