TL;DR
SegCompass introduces an interpretable alignment method using a Sparse Autoencoder for reasoning segmentation, improving transparency and performance in vision-language tasks.
Contribution
It proposes a novel SAE-based alignment pathway that enhances interpretability and achieves state-of-the-art results in reasoning segmentation benchmarks.
Findings
Matches or surpasses state-of-the-art performance on five benchmarks.
Strong correlation between sparse concept quality and segmentation accuracy.
Provides a more transparent and coherent reasoning segmentation process.
Abstract
While large language models provide strong compositional reasoning, existing reasoning segmentation pipelines fail to transparently connect this reasoning to visual perception. Current methods, such as latent query alignment, are end-to-end yet opaque "black boxes". Conversely, textual localization readout is merely readable, not truly interpretable, often functioning as an unconstrained post-hoc step. To bridge this interpretability gap, we propose SegCompass, an end-to-end model that leverages a Sparse Autoencoder (SAE) to forge an explicit, interpretable, and differentiable alignment pathway. Given an image-instruction pair, SegCompass first generates a chain-of-thought (CoT) trace. The core of our method is an SAE that maps both the CoT and visual tokens into a shared, high-dimensional sparse concept space. A query codebook selects salient concepts from this space, which are then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
