Artificial Entanglement in the Fine-Tuning of Large Language Models
Min Chen, Zihan Wang, Canyu Chen, Zeguan Wu, Manling Li, Junyu Liu

TL;DR
This paper introduces a quantum-information-inspired framework to analyze how low-rank fine-tuning methods like LoRA affect the entanglement structure of large language models, revealing distinct internal and external entanglement patterns.
Contribution
It applies a novel entanglement perspective to understand parameter-efficient fine-tuning, uncovering internal and external entanglement laws and proposing a 'no-hair' property explaining effectiveness.
Findings
Internal entanglement follows a volume law with a suppression 'valley'
External entanglement follows an area law with logarithmic corrections
Differences in entanglement do not affect attention outputs
Abstract
Large language models (LLMs) can be adapted to new tasks using parameter-efficient fine-tuning (PEFT) methods that modify only a small number of trainable parameters, often through low-rank updates. In this work, we adopt a quantum-information-inspired perspective to understand their effectiveness. From this perspective, low-rank parameterizations naturally correspond to low-dimensional Matrix Product States (MPS) representations, which enable entanglement-based characterizations of parameter structure. Thereby, we term and measure "Artificial Entanglement", defined as the entanglement entropy of the parameters in artificial neural networks (in particular the LLMs). We first study the representative low-rank adaptation (LoRA) PEFT method, alongside full fine-tuning (FFT), using LLaMA models at the 1B and 8B scales trained on the Tulu3 and OpenThoughts3 datasets, and uncover: (i)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum many-body systems · Machine Learning in Materials Science · Quantum Computing Algorithms and Architecture
