Agglomerative Token Clustering
Joakim Bruslund Haurum, Sergio Escalera, Graham W. Taylor, Thomas B., Moeslund

TL;DR
Agglomerative Token Clustering (ATC) is a new hierarchical token merging method that improves performance across various vision tasks without additional learnable parameters, especially effective with low token retention.
Contribution
ATC introduces a hierarchical clustering-based token merging technique that outperforms existing methods across multiple vision tasks without requiring extra parameters.
Findings
ATC achieves state-of-the-art results in image classification, synthesis, and detection.
ATC performs on par with prior methods even without fine-tuning.
ATC is highly effective at low token retention rates.
Abstract
We present Agglomerative Token Clustering (ATC), a novel token merging method that consistently outperforms previous token merging and pruning methods across image classification, image synthesis, and object detection & segmentation tasks. ATC merges clusters through bottom-up hierarchical clustering, without the introduction of extra learnable parameters. We find that ATC achieves state-of-the-art performance across all tasks, and can even perform on par with prior state-of-the-art when applied off-the-shelf, i.e. without fine-tuning. ATC is particularly effective when applied with low keep rates, where only a small fraction of tokens are kept and retaining task performance is especially difficult.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗joakimbh/ATCmodel· ♡ 1♡ 1
- 🤗joakimbh/atc_small-coco17-averagemodel
- 🤗joakimbh/atc_small-coco17-completemodel
- 🤗joakimbh/atc_small-coco17-singlemodel
- 🤗joakimbh/tome_small-coco17model
- 🤗joakimbh/tome_tiny-coco17model
- 🤗joakimbh/atc_tiny-coco17-averagemodel
- 🤗joakimbh/atc_tiny-coco17-completemodel
- 🤗joakimbh/atc_tiny-coco17-singlemodel
- 🤗joakimbh/atc_base-im1k-averagemodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Network Security and Intrusion Detection · Authorship Attribution and Profiling
MethodsPruning
