ModuSeg: Decoupling Object Discovery and Semantic Retrieval for Training-Free Weakly Supervised Segmentation

Qingze He; Fagui Liu; Dengke Zhang; Qingmao Wei; Quan Tang

arXiv:2604.07021·cs.CV·April 17, 2026

ModuSeg: Decoupling Object Discovery and Semantic Retrieval for Training-Free Weakly Supervised Segmentation

Qingze He, Fagui Liu, Dengke Zhang, Qingmao Wei, Quan Tang

PDF

1 Repo 1 Datasets

TL;DR

ModuSeg introduces a training-free, decoupled framework for weakly supervised segmentation that leverages foundation models and non-parametric feature retrieval to improve boundary accuracy and reduce training complexity.

Contribution

It presents a novel decoupled architecture that separates object discovery from semantic assignment, utilizing foundation models and offline feature banks without parameter fine-tuning.

Findings

01

Achieves competitive performance on benchmark datasets.

02

Better preserves fine object boundaries without fine-tuning.

03

Effectively mitigates boundary ambiguity and quantization errors.

Abstract

Weakly supervised semantic segmentation aims to achieve pixel-level predictions using image-level labels. Existing methods typically entangle semantic recognition and object localization, which often leads models to focus exclusively on sparse discriminative regions. Although foundation models show immense potential, many approaches still follow the tightly coupled optimization paradigm, struggling to effectively alleviate pseudo-label noise and often relying on time-consuming multi-stage retraining or unstable end-to-end joint optimization. To address the above challenges, we present ModuSeg, a training-free weakly supervised semantic segmentation framework centered on explicitly decoupling object discovery and semantic assignment. Specifically, we integrate a general mask proposer to extract geometric proposals with reliable boundaries, while leveraging semantic foundation models to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Autumnair007/ModuSeg
github

Datasets

QZing007/ModuSeg-Pseudo-Masks
dataset· 55 dl
55 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.