BYOCL: Build Your Own Consistent Latent with Hierarchical Representative   Latent Clustering

Jiayue Dai; Yunya Wang; Yihan Fang; Yuetong Chen; Butian Xiong

arXiv:2410.15060·cs.CV·April 11, 2025

BYOCL: Build Your Own Consistent Latent with Hierarchical Representative Latent Clustering

Jiayue Dai, Yunya Wang, Yihan Fang, Yuetong Chen, Butian Xiong

PDF

Open Access 1 Repo

TL;DR

BYOCL introduces a hierarchical clustering-based segmentation method that enhances consistency across image sequences, significantly reducing computational costs and outperforming existing models without additional training.

Contribution

It presents a novel, training-free, plug-and-play hierarchical clustering approach for consistent segmentation across image sequences using foundation models.

Findings

01

Outperforms SAM in extensive experiments

02

Reduces time and space complexity exponentially

03

Achieves state-of-the-art segmentation results

Abstract

To address the semantic inconsistency issue with SAM or other single-image segmentation models handling image sequences, we introduce BYOCL. This novel model outperforms SAM in extensive experiments, showcasing its Hierarchical prototype capabilities across CLIP and other representations. BYOCL significantly reduces time and space consumption by dividing inputs into smaller batches, achieving exponential time reduction compared to previous methods. Our approach leverages the SAM image encoder for feature extraction, followed by Intra-Batch and Inter-Batch clustering algorithms. Extensive experiments demonstrate that BYOCL far exceeds the previous state-of-the-art single image segmentation model. Our work is the first to apply consistent segmentation using foundation models without requiring training, utilizing plug-and-play modules for any latent space, making our method highly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cyt1202/byocl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Topic Modeling

MethodsContrastive Language-Image Pre-training · Segment Anything Model