Defragmenting Language Models: An Interpretability-based Approach for Vocabulary Expansion

Maitrey Mehta; Nishant Subramani; Zhichao Xu; Ashim Gupta; Vivek Srikumar

arXiv:2604.16656·cs.CL·April 21, 2026

Defragmenting Language Models: An Interpretability-based Approach for Vocabulary Expansion

Maitrey Mehta, Nishant Subramani, Zhichao Xu, Ashim Gupta, Vivek Srikumar

PDF

TL;DR

This paper introduces interpretability-based vocabulary expansion to improve tokenization efficiency in multilingual LLMs, demonstrating significant gains over traditional methods and proposing FragMend to further enhance performance.

Contribution

It advances interpretability-based vocabulary expansion methods, showing their superiority over frequency-based approaches and introducing FragMend for improved tokenization in non-Latin languages.

Findings

01

Interpretability-based methods outperform frequency-based methods in vocabulary expansion.

02

Embedding initialization significantly impacts tokenization efficiency, with ~20 pts gains.

03

FragMend further improves tokenization by leveraging subword detokenization phenomena.

Abstract

All languages are equal; when it comes to tokenization, some are more equal than others. Tokens are the hidden currency that dictate the cost and latency of access to contemporary LLMs. However, many languages written in non-Latin scripts observe a poor exchange rate: LLMs take several multiples of tokens to encode the same information in many languages as they do for English. Our analysis reveals that this issue, known as 'token over-fragmentation', persists in modern open-weight LLMs. The standard remedy is vocabulary expansion that adds target language items missing from the model's vocabulary. In this work, we comprehensively study and advance interpretability-based vocabulary expansion, a new research direction. We focus on two core decisions in the vocabulary expansion process: What items should we add? and How should we initialize their corresponding input and output embeddings?…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.