TM-UNet: Token-Memory Enhanced Sequential Modeling for Efficient Medical Image Segmentation
Yaxuan Jiao, Qing Xu, Yuxiang Luo, Xiangjian He, Zhen Chen, Wenting Duan

TL;DR
TM-UNet introduces a lightweight, memory-augmented transformer framework for efficient medical image segmentation, achieving high accuracy with reduced computational cost.
Contribution
It proposes a novel multi-scale token-memory block that captures long-range dependencies efficiently using a dynamic memory mechanism and hierarchical feature extraction.
Findings
Outperforms state-of-the-art methods on multiple medical segmentation datasets.
Reduces computational cost significantly compared to existing transformer-based models.
Demonstrates effective long-range dependency modeling with linear complexity.
Abstract
Medical image segmentation is essential for clinical diagnosis and treatment planning. Although transformer-based methods have achieved remarkable results, their high computational cost hinders clinical deployment. To address this issue, we propose TM-UNet, a novel lightweight framework that integrates token sequence modeling with an efficient memory mechanism for efficient medical segmentation. Specifically, we introduce a multi-scale token-memory (MSTM) block that transforms 2D spatial features into token sequences through strategic spatial scanning, leveraging matrix memory cells to selectively retain and propagate discriminative contextual information across tokens. This novel token-memory mechanism acts as a dynamic knowledge store that captures long-range dependencies with linear complexity, enabling efficient global reasoning without redundant computation. Our MSTM block further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · AI in cancer detection · Generative Adversarial Networks and Image Synthesis
