GlitchMiner: Mining Glitch Tokens in Large Language Models via Gradient-based Discrete Optimization

Zihui Wu; Haichang Gao; Ping Wang; Shudong Zhang; Zhaoxiang Liu; Shiguo Lian

arXiv:2410.15052·cs.AI·November 11, 2025

GlitchMiner: Mining Glitch Tokens in Large Language Models via Gradient-based Discrete Optimization

Zihui Wu, Haichang Gao, Ping Wang, Shudong Zhang, Zhaoxiang Liu, Shiguo Lian

PDF

Open Access 1 Repo

TL;DR

GlitchMiner is a novel gradient-guided framework that effectively detects glitch tokens in large language models by maximizing predictive entropy, outperforming existing methods in accuracy and efficiency.

Contribution

It introduces a behavior-driven, gradient-based optimization approach for glitch token detection that is model-agnostic and scalable.

Findings

01

Outperforms existing detection methods in accuracy

02

Demonstrates high query efficiency across multiple LLMs

03

Provides a generalizable approach for glitch token discovery

Abstract

Glitch tokens, inputs that trigger unpredictable or anomalous behavior in Large Language Models (LLMs), pose significant challenges to model reliability and safety. Existing detection methods primarily rely on heuristic embedding patterns or statistical anomalies within internal representations, limiting their generalizability across different model architectures and potentially missing anomalies that deviate from observed patterns. We introduce GlitchMiner, an behavior-driven framework designed to identify glitch tokens by maximizing predictive entropy. Leveraging a gradient-guided local search strategy, GlitchMiner efficiently explores the discrete token space without relying on model-specific heuristics or large-batch sampling. Extensive experiments across ten LLMs from five major model families demonstrate that GlitchMiner consistently outperforms existing approaches in detection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wooozihui/GlitchMiner
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Data Mining Algorithms and Applications