HoloByte: Continuous Hyperspherical Distillation for Tokenizer-Free Modeling
Vladimer Khasia

TL;DR
HoloByte introduces a tokenizer-free, continuous hyperspherical distillation framework for sequence modeling, reducing complexity and improving performance over traditional tokenization methods by operating on continuous representations.
Contribution
It proposes a novel continuous hyperspherical approach for tokenizer-free sequence modeling, with theoretical guarantees and empirical improvements over BPE baselines.
Findings
Reduces attention complexity from O(N^2 D) to a more efficient form.
Outperforms BPE baseline under matched parameters.
Provides a mathematically rigorous foundation for vocabulary-invariant modeling.
Abstract
Sequence modeling universally relies on discrete subword tokenization to circumvent the computational intractability of native byte-level attention. However, this heuristic quantization imposes artificial morphological boundaries, enforces vocabulary dependence, and fractures the continuity of the optimization landscape. To resolve this dichotomy, we introduce \textbf{HoloByte}: a strictly tokenizer-free framework utilizing Continuous Hyperspherical Distillation. HoloByte partitions discrete byte sequences into fixed-capacity chunks and projects them into a continuous, strictly bounded hyperspherical manifold via an invertible, dimension-preserving orthogonal rotation operator. This spatial superposition allows a macroscopic transformer to operate exclusively on compressed continuous representations, formally reducing the exact attention time complexity from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Ferroelectric and Negative Capacitance Devices · Logic, programming, and type systems
