LangMamba: A Language-driven Mamba Framework for Low-dose CT Denoising with Vision-language Models
Zhihao Chen, Tao Chen, Chenhui Wang, Qi Gao, Huidong Xie, Chuang Niu, Ge Wang, Hongming Shan

TL;DR
LangMamba introduces a novel framework that leverages vision-language models and semantic guidance to significantly improve low-dose CT denoising, enhancing detail preservation, generalizability, and explainability.
Contribution
It presents a two-stage learning strategy combining a language-guided autoencoder and semantic-enhanced denoising with dual-space alignment, pioneering language-driven supervision in LDCT denoising.
Findings
Outperforms state-of-the-art denoising methods on public datasets.
Exhibits strong generalizability to unseen datasets.
Improves explainability through language-guided insights.
Abstract
Low-dose computed tomography (LDCT) reduces radiation exposure but often degrades image quality, potentially compromising diagnostic accuracy. Existing deep learning-based denoising methods focus primarily on pixel-level mappings, overlooking the potential benefits of high-level semantic guidance. Recent advances in vision-language models (VLMs) suggest that language can serve as a powerful tool for capturing structured semantic information, offering new opportunities to improve LDCT reconstruction. In this paper, we introduce LangMamba, a Language-driven Mamba framework for LDCT denoising that leverages VLM-derived representations to enhance supervision from normal-dose CT (NDCT). LangMamba follows a two-stage learning strategy. First, we pre-train a Language-guided AutoEncoder (LangAE) that leverages frozen VLMs to map NDCT images into a semantic space enriched with anatomical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging Techniques and Applications · Advanced X-ray and CT Imaging · Digital Radiography and Breast Imaging
MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces · ALIGN · Focus
