Neuron-Aware Data Selection In Instruction Tuning For Large Language Models

Xin Chen; Junchao Wu; Shu Yang; Runzhe Zhan; Zeyu Wu; Min Yang; Shujian Huang; Lidia S. Chao; Derek F. Wong

arXiv:2603.13201·cs.CL·March 16, 2026

Neuron-Aware Data Selection In Instruction Tuning For Large Language Models

Xin Chen, Junchao Wu, Shu Yang, Runzhe Zhan, Zeyu Wu, Min Yang, Shujian Huang, Lidia S. Chao, Derek F. Wong

PDF

Open Access 3 Reviews

TL;DR

This paper introduces NAIT, a neuron-aware data selection framework for instruction tuning of large language models, which improves performance by selecting high-impact data based on neuron activation similarity.

Contribution

NAIT is a novel, efficient method that evaluates and selects instruction tuning data by analyzing neuron activation patterns to enhance LLM capabilities.

Findings

01

NAIT-selected data outperforms other methods on various tasks.

02

Neuron activation features transfer across different capabilities.

03

Logical reasoning data has strong transferability.

Abstract

Instruction Tuning (IT) has been proven to be an effective approach to unlock the powerful capabilities of large language models (LLMs). Recent studies indicate that excessive IT data can degrade LLMs performance, while carefully selecting a small subset of high-quality IT data can significantly enhance their capabilities. Therefore, identifying the most efficient subset data from the IT dataset to effectively develop either specific or general abilities in LLMs has become a critical challenge. To address this, we propose a novel and efficient framework called NAIT. NAIT evaluates the impact of IT data on LLMs performance by analyzing the similarity of neuron activation patterns between the IT dataset and the target domain capability. Specifically, NAIT captures neuron activation patterns from in-domain datasets of target domain capabilities to construct reusable and transferable neuron…

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 4Confidence 4

Strengths

- While neuron activation analysis has been used to interpret LLMs, its application as a direct signal for instruction data selection is novel and interpretable. - The paper is well written, and the experiments are conducted on a wide range of datasets and base models. - The cost-efficiency analysis is a crucial and compelling strength, showing improvements in speed and cost, which is a highly practical advantage in the LLM era. - The interpretable analysis on transferability and the relation o

Weaknesses

- The framework's first step requires a capabilities-specific in-domain dataset. For the tasks evaluated (MMLU, GSM, etc.), these datasets are readily available from established benchmarks. However, if one wishes to enhance a more abstract or novel capability, curating a high-quality, representative in-domain dataset becomes a significant, and potentially subjective, bottleneck. - This paper should provide a few qualitative examples to illustrate the difference between data points in the "core

Reviewer 02Rating 6Confidence 3

Strengths

- Strong data‑efficiency. NAIT with 10% of Alpaca‑GPT‑4 matches/exceeds full‑data tuning and outperforms several selection baselines. - The neuron‑feature construction and per‑candidate alignment score are conceptually simple and effective across many tasks.

Weaknesses

- The experiments didn’t compare against some strong recent data selection methods like LESS (ICML 2024). - The paper seems doesn’t specify which layers’ activations are used to compute features or why. Since different layers capture different types of information, this choice may matters. explanation or ablations could be added.

Reviewer 03Rating 4Confidence 4

Strengths

1. The idea of using the model’s own internal neuron activation patterns to evaluate data quality is novel and interesting. Unlike prior data selection methods that rely on external scorers or surface heuristics, NAIT directly taps into the model’s internal representation. This provides a more interpretable and fine-grained signal and is externally independent. 2. NAIT effectively addresses the scenario of enhancing specific capabilities in an LLM. The method is flexible – by providing an in-dom

Weaknesses

1. A practical concern is that NAIT requires a representative in-domain dataset for each target capability as a starting point. In real scenarios, such labeled data may not be readily available for every “capability” one wishes to improve. This reliance potentially limits NAIT’s applicability to cases where one can clearly define and obtain data for the target skill. If the in-domain examples are too few or not truly representative, the quality of the activation feature (and thus the selection)

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Multimodal Machine Learning Applications