Single-pass Adaptive Image Tokenization for Minimum Program Search
Shivam Duggal, Sanghyun Byun, William T. Freeman, Antonio Torralba, Phillip Isola

TL;DR
This paper introduces KARL, a single-pass adaptive image tokenizer inspired by Kolmogorov Complexity, which predicts the optimal number of tokens for an image to efficiently balance compression and reconstruction quality.
Contribution
KARL is the first single-pass adaptive tokenizer that predicts token count based on image complexity, eliminating the need for test-time search and aligning with principles of Algorithmic Information Theory.
Findings
KARL matches the performance of multi-pass adaptive tokenizers.
Scaling laws reveal the impact of model size and tokenization methods.
Analysis shows alignment between predicted complexity and human intuition.
Abstract
According to Algorithmic Information Theory (AIT) -- Intelligent representations compress data into the shortest possible program that can reconstruct its content, exhibiting low Kolmogorov Complexity (KC). In contrast, most visual representation learning systems use fixed-length representations for all inputs, ignoring variations in complexity or familiarity. Recent adaptive tokenization methods address this by allocating variable-length representations but typically require test-time search over multiple encodings to find the most predictive one. Inspired by Kolmogorov Complexity principles, we propose a single-pass adaptive tokenizer, KARL, which predicts the appropriate number of tokens for an image in a single forward pass, halting once its approximate KC is reached. The token count serves as a proxy for the minimum description length. KARL's training procedure closely resembles…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputability, Logic, AI Algorithms · Adversarial Robustness in Machine Learning · Ferroelectric and Negative Capacitance Devices
