Mutagenesis screen to map the functions of parameters of Large Language Models
Yue Hu, Gang Hu, Jixin Zheng, Patrick X. Zhao, Ruimeng Wang

TL;DR
This study applies a biological mutagenesis-inspired approach to systematically explore how parameters in large language models influence their functionalities, revealing complex structures and sensitivities within the models.
Contribution
Introduces a mutagenesis screening method for LLMs to analyze parameter-function relationships, uncovering fine structures and mutation sensitivities not previously documented.
Findings
Mutagenesis reveals diverse sensitivities across model matrices.
Mutations cluster along axes, indicating structured parameter-function relationships.
Certain mutations lead to specific output styles, such as poetic or conversational.
Abstract
Large Language Models (LLMs) have significantly advanced artificial intelligence, excelling in numerous tasks. Although the functionality of a model is inherently tied to its parameters, a systematic method for exploring the connections between the parameters and the functionality are lacking. Models sharing similar structure and parameter counts exhibit significant performance disparities across various tasks, prompting investigations into the varying patterns that govern their performance. We adopted a mutagenesis screen approach inspired by the methods used in biological studies, to investigate Llama2-7b and Zephyr. This technique involved mutating elements within the models' matrices to their maximum or minimum values to examine the relationship between model parameters and their functionalities. Our research uncovered multiple levels of fine structures within both models. Many…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
