LayerTracer: A Joint Task-Particle and Vulnerable-Layer Analysis framework for Arbitrary Large Language Model Architectures
Yuhang Wu, Qinyuan Liu, Qiuyang Zhao, and Qingwei Chong

TL;DR
LayerTracer is a versatile analysis framework for LLMs that identifies task execution points and vulnerability layers, aiding in architecture design and interpretability across various models.
Contribution
It introduces a novel, architecture-agnostic method to analyze hierarchical representations and robustness in diverse LLM architectures.
Findings
Task particles mainly occur in deep layers across models.
Larger models show increased hierarchical robustness.
LayerTracer accurately locates task-effective and vulnerable layers.
Abstract
Currently, Large Language Models (LLMs) feature a diversified architectural landscape, including traditional Transformer, GateDeltaNet, and Mamba. However, the evolutionary laws of hierarchical representations, task knowledge formation positions, and network robustness bottleneck mechanisms in various LLM architectures remain unclear, posing core challenges for hybrid architecture design and model optimization. This paper proposes LayerTracer, an architecture-agnostic end-to-end analysis framework compatible with any LLM architecture. By extracting hidden states layer-by-layer and mapping them to vocabulary probability distributions, it achieves joint analysis of task particle localization and layer vulnerability quantification. We define the task particle as the key layer where the target token probability first rises significantly, representing the model's task execution starting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
