LoGoSeg: Integrating Local and Global Features for Open-Vocabulary Semantic Segmentation

Junyang Chen; Xiangbo Lv; Zhiqiang Kou; Xingdong Sheng; Ning Xu; Yiguo Qiao

arXiv:2602.05578·cs.CV·February 13, 2026

LoGoSeg: Integrating Local and Global Features for Open-Vocabulary Semantic Segmentation

Junyang Chen, Xiangbo Lv, Zhiqiang Kou, Xingdong Sheng, Ning Xu, Yiguo Qiao

PDF

Open Access 1 Video

TL;DR

LoGoSeg introduces a novel framework for open-vocabulary semantic segmentation that combines local and global features with object priors and region-aware alignment, achieving efficient and accurate pixel-wise segmentation for seen and unseen categories.

Contribution

It presents a single-stage method integrating object priors, region-level alignment, and dual-stream fusion, eliminating the need for external proposals or additional datasets.

Findings

01

Competitive performance on six benchmarks

02

Strong generalization to unseen categories

03

Efficient single-stage framework

Abstract

Open-vocabulary semantic segmentation (OVSS) extends traditional closed-set segmentation by enabling pixel-wise annotation for both seen and unseen categories using arbitrary textual descriptions. While existing methods leverage vision-language models (VLMs) like CLIP, their reliance on image-level pretraining often results in imprecise spatial alignment, leading to mismatched segmentations in ambiguous or cluttered scenes. However, most existing approaches lack strong object priors and region-level constraints, which can lead to object hallucination or missed detections, further degrading performance. To address these challenges, we propose LoGoSeg, an efficient single-stage framework that integrates three key innovations: (i) an object existence prior that dynamically weights relevant categories through global image-text similarity, effectively reducing hallucinations; (ii) a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

LoGoSeg: Integrating Local and Global Features for Open-Vocabulary Semantic Segmentation· underline

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications · Topic Modeling