EMA-SAM: Exponential Moving-average for SAM-based PTMC Segmentation
Maryam Dialameh, Hossein Rajabzadeh, Jung Suk Sim, and Hyock Ju Kwon

TL;DR
EMA-SAM enhances the stability and accuracy of tumor segmentation in ultrasound videos by integrating an exponential moving average mechanism into SAM-2, enabling real-time, coherent tracking during interventions.
Contribution
The paper introduces EMA-SAM, a novel extension of SAM-2 that incorporates a confidence-weighted exponential moving average to improve temporal stability and accuracy in ultrasound tumor segmentation.
Findings
MaxDice improved from 0.82 to 0.86 on PTMC dataset.
False positives reduced by 29%.
Achieves 30 FPS with minimal computational overhead.
Abstract
Papillary thyroid microcarcinoma (PTMC) is increasingly managed with radio-frequency ablation (RFA), yet accurate lesion segmentation in ultrasound videos remains difficult due to low contrast, probe-induced motion, and heat-related artifacts. The recent Segment Anything Model 2 (SAM-2) generalizes well to static images, but its frame-independent design yields unstable predictions and temporal drift in interventional ultrasound. We introduce \textbf{EMA-SAM}, a lightweight extension of SAM-2 that incorporates a confidence-weighted exponential moving average pointer into the memory bank, providing a stable latent prototype of the tumour across frames. This design preserves temporal coherence through probe pressure and bubble occlusion while rapidly adapting once clear evidence reappears. On our curated PTMC-RFA dataset (124 minutes, 13 patients), EMA-SAM improves \emph{maxDice} from 0.82…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Advanced Neural Network Applications · Ultrasound Imaging and Elastography
