SegCompass: Exploring Interpretable Alignment with Sparse Autoencoders for Enhanced Reasoning Segmentation

Zhenyu Lu; Liupeng Li; Jinpeng Wang; Haoqian Kang; Yan Feng; Ke Chen; Yaowei Wang

arXiv:2605.22658·cs.CV·May 22, 2026

SegCompass: Exploring Interpretable Alignment with Sparse Autoencoders for Enhanced Reasoning Segmentation

Zhenyu Lu, Liupeng Li, Jinpeng Wang, Haoqian Kang, Yan Feng, Ke Chen, Yaowei Wang

PDF

1 Repo

TL;DR

SegCompass introduces an interpretable alignment method using a Sparse Autoencoder for reasoning segmentation, improving transparency and performance in vision-language tasks.

Contribution

It proposes a novel SAE-based alignment pathway that enhances interpretability and achieves state-of-the-art results in reasoning segmentation benchmarks.

Findings

01

Matches or surpasses state-of-the-art performance on five benchmarks.

02

Strong correlation between sparse concept quality and segmentation accuracy.

03

Provides a more transparent and coherent reasoning segmentation process.

Abstract

While large language models provide strong compositional reasoning, existing reasoning segmentation pipelines fail to transparently connect this reasoning to visual perception. Current methods, such as latent query alignment, are end-to-end yet opaque "black boxes". Conversely, textual localization readout is merely readable, not truly interpretable, often functioning as an unconstrained post-hoc step. To bridge this interpretability gap, we propose SegCompass, an end-to-end model that leverages a Sparse Autoencoder (SAE) to forge an explicit, interpretable, and differentiable alignment pathway. Given an image-instruction pair, SegCompass first generates a chain-of-thought (CoT) trace. The core of our method is an SAE that maps both the CoT and visual tokens into a shared, high-dimensional sparse concept space. A query codebook selects salient concepts from this space, which are then…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ZhenyuLU-Heliodore/SegCompass
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.