Single-pass Adaptive Image Tokenization for Minimum Program Search

Shivam Duggal; Sanghyun Byun; William T. Freeman; Antonio Torralba; Phillip Isola

arXiv:2507.07995·cs.CV·July 11, 2025

Single-pass Adaptive Image Tokenization for Minimum Program Search

Shivam Duggal, Sanghyun Byun, William T. Freeman, Antonio Torralba, Phillip Isola

PDF

Open Access

TL;DR

This paper introduces KARL, a single-pass adaptive image tokenizer inspired by Kolmogorov Complexity, which predicts the optimal number of tokens for an image to efficiently balance compression and reconstruction quality.

Contribution

KARL is the first single-pass adaptive tokenizer that predicts token count based on image complexity, eliminating the need for test-time search and aligning with principles of Algorithmic Information Theory.

Findings

01

KARL matches the performance of multi-pass adaptive tokenizers.

02

Scaling laws reveal the impact of model size and tokenization methods.

03

Analysis shows alignment between predicted complexity and human intuition.

Abstract

According to Algorithmic Information Theory (AIT) -- Intelligent representations compress data into the shortest possible program that can reconstruct its content, exhibiting low Kolmogorov Complexity (KC). In contrast, most visual representation learning systems use fixed-length representations for all inputs, ignoring variations in complexity or familiarity. Recent adaptive tokenization methods address this by allocating variable-length representations but typically require test-time search over multiple encodings to find the most predictive one. Inspired by Kolmogorov Complexity principles, we propose a single-pass adaptive tokenizer, KARL, which predicts the appropriate number of tokens for an image in a single forward pass, halting once its approximate KC is reached. The token count serves as a proxy for the minimum description length. KARL's training procedure closely resembles…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputability, Logic, AI Algorithms · Adversarial Robustness in Machine Learning · Ferroelectric and Negative Capacitance Devices