Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning
Abhinav Bandari, Lu Yin, Cheng-Yu Hsieh, Ajay Kumar Jaiswal, Tianlong, Chen, Li Shen, Ranjay Krishna, Shiwei Liu

TL;DR
This paper investigates the impact of calibration data choice on large language model pruning, revealing that the commonly used C4 dataset is suboptimal and that alternative datasets can improve pruning efficiency and performance.
Contribution
It systematically evaluates various calibration datasets for LLM pruning, challenging the default use of C4 and highlighting the benefits of arithmetic and downstream datasets.
Findings
C4 is not the optimal calibration dataset for LLM pruning
Arithmetic datasets perform as well or better than pre-training datasets
Downstream datasets do not always improve downstream task performance
Abstract
Network pruning has emerged as a potential solution to make LLMs cheaper to deploy. However, existing LLM pruning approaches universally rely on the C4 dataset as the calibration data for calculating pruning scores, leaving its optimality unexplored. In this study, we evaluate the choice of calibration data on LLM pruning, across a wide range of datasets that are most commonly used in LLM training and evaluation, including four pertaining datasets as well as three categories of downstream tasks encompassing nine datasets. Each downstream dataset is prompted with In-Context Learning (ICL) and Chain-of-Thought (CoT), respectively. Besides the already intriguing observation that the choice of calibration data significantly impacts the performance of pruned LLMs, our results also uncover several subtle and often unexpected findings, summarized as follows: (1) C4 is not the optimal choice…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMineral Processing and Grinding · Metallurgical Processes and Thermodynamics · Advanced Surface Polishing Techniques
MethodsPruning
