TAMISeg: Text-Aligned Multi-scale Medical Image Segmentation with Semantic Encoder Distillation

Qiang Gao; Yi Wang; Yong Zhang; Yong Li; Yongbing Deng; Lan Du; Cunjian Chen

arXiv:2604.10912·cs.CV·April 14, 2026

TAMISeg: Text-Aligned Multi-scale Medical Image Segmentation with Semantic Encoder Distillation

Qiang Gao, Yi Wang, Yong Zhang, Yong Li, Yongbing Deng, Lan Du, Cunjian Chen

PDF

1 Repo

TL;DR

TAMISeg is a novel text-guided medical image segmentation framework that leverages semantic distillation and multi-scale decoding to improve segmentation accuracy with limited annotations.

Contribution

It introduces a multi-component framework combining a robust encoder, semantic distillation, and scale-adaptive decoding for enhanced medical image segmentation.

Findings

01

Outperforms existing methods on multiple datasets.

02

Demonstrates robustness to image noise and low contrast.

03

Achieves superior segmentation accuracy with limited annotations.

Abstract

Medical image segmentation remains challenging due to limited fine-grained annotations, complex anatomical structures, and image degradation from noise, low contrast, or illumination variation. We propose TAMISeg, a text-guided segmentation framework that incorporates clinical language prompts and semantic distillation as auxiliary semantic cues to enhance visual understanding and reduce reliance on pixel-level fine-grained annotations. TAMISeg integrates three core components: a consistency-aware encoder pretrained with strong perturbations for robust feature extraction, a semantic encoder distillation module with supervision from a frozen DINOv3 teacher to enhance semantic discriminability, and a scale-adaptive decoder that segments anatomical structures across different spatial scales. Experiments on the Kvasir-SEG, MosMedData+, and QaTa-COV19 datasets demonstrate that TAMISeg…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qczggaoqiang/TAMISeg
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.