TopoLoRA-SAM: Topology-Aware Parameter-Efficient Adaptation of Foundation Segmenters for Thin-Structure and Cross-Domain Binary Semantic Segmentation
Salim Khazem

TL;DR
TopoLoRA-SAM introduces a topology-aware, parameter-efficient method to adapt foundation segmentation models like SAM for thin-structure and cross-domain binary segmentation, achieving superior results with minimal fine-tuning.
Contribution
It proposes a novel topology-aware, parameter-efficient adaptation framework using LoRA and topology supervision for foundation models in binary segmentation tasks.
Findings
Achieves state-of-the-art Dice scores on multiple benchmarks.
Uses only 5.2% of model parameters for training.
Improves robustness on challenging datasets.
Abstract
Foundation segmentation models such as the Segment Anything Model (SAM) exhibit strong zero-shot generalization through large-scale pretraining, but adapting them to domain-specific semantic segmentation remains challenging, particularly for thin structures (e.g., retinal vessels) and noisy modalities (e.g., SAR imagery). Full fine-tuning is computationally expensive and risks catastrophic forgetting. We propose \textbf{TopoLoRA-SAM}, a topology-aware and parameter-efficient adaptation framework for binary semantic segmentation. TopoLoRA-SAM injects Low-Rank Adaptation (LoRA) into the frozen ViT encoder, augmented with a lightweight spatial convolutional adapter and optional topology-aware supervision via differentiable clDice. We evaluate our approach on five benchmarks spanning retinal vessel segmentation (DRIVE, STARE, CHASE\_DB1), polyp segmentation (Kvasir-SEG), and SAR sea/land…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
