LangMamba: A Language-driven Mamba Framework for Low-dose CT Denoising with Vision-language Models

Zhihao Chen; Tao Chen; Chenhui Wang; Qi Gao; Huidong Xie; Chuang Niu; Ge Wang; Hongming Shan

arXiv:2507.06140·eess.IV·July 9, 2025

LangMamba: A Language-driven Mamba Framework for Low-dose CT Denoising with Vision-language Models

Zhihao Chen, Tao Chen, Chenhui Wang, Qi Gao, Huidong Xie, Chuang Niu, Ge Wang, Hongming Shan

PDF

Open Access 1 Repo

TL;DR

LangMamba introduces a novel framework that leverages vision-language models and semantic guidance to significantly improve low-dose CT denoising, enhancing detail preservation, generalizability, and explainability.

Contribution

It presents a two-stage learning strategy combining a language-guided autoencoder and semantic-enhanced denoising with dual-space alignment, pioneering language-driven supervision in LDCT denoising.

Findings

01

Outperforms state-of-the-art denoising methods on public datasets.

02

Exhibits strong generalizability to unseen datasets.

03

Improves explainability through language-guided insights.

Abstract

Low-dose computed tomography (LDCT) reduces radiation exposure but often degrades image quality, potentially compromising diagnostic accuracy. Existing deep learning-based denoising methods focus primarily on pixel-level mappings, overlooking the potential benefits of high-level semantic guidance. Recent advances in vision-language models (VLMs) suggest that language can serve as a powerful tool for capturing structured semantic information, offering new opportunities to improve LDCT reconstruction. In this paper, we introduce LangMamba, a Language-driven Mamba framework for LDCT denoising that leverages VLM-derived representations to enhance supervision from normal-dose CT (NDCT). LangMamba follows a two-stage learning strategy. First, we pre-train a Language-guided AutoEncoder (LangAE) that leverages frozen VLMs to map NDCT images into a semantic space enriched with anatomical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hao1635/langmamba
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Imaging Techniques and Applications · Advanced X-ray and CT Imaging · Digital Radiography and Breast Imaging

MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces · ALIGN · Focus