Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models
Terry Yue Zhuo, Armel Zebaze, Nitchakarn Suppattarachai, Leandro von, Werra, Harm de Vries, Qian Liu, Niklas Muennighoff

TL;DR
Astraios introduces 28 instruction-tuned models across various sizes and methods, revealing that full fine-tuning generally outperforms PEFT, but LoRA offers a good cost-performance balance, with larger models showing reduced robustness and security.
Contribution
The paper provides a comprehensive evaluation of 28 instruction-tuned models using multiple tuning methods across different model sizes, highlighting the trade-offs and effectiveness of PEFT methods like LoRA.
Findings
FFT generally yields the best downstream performance.
LoRA offers the best cost-performance trade-off.
Larger models show reduced robustness and security.
Abstract
The high cost of full-parameter fine-tuning (FFT) of Large Language Models (LLMs) has led to a series of parameter-efficient fine-tuning (PEFT) methods. However, it remains unclear which methods provide the best cost-performance trade-off at different model scales. We introduce Astraios, a suite of 28 instruction-tuned OctoCoder models using 7 tuning methods and 4 model sizes up to 16 billion parameters. Through investigations across 5 tasks and 8 different datasets encompassing both code comprehension and code generation tasks, we find that FFT generally leads to the best downstream performance across all scales, and PEFT methods differ significantly in their efficacy based on the model scale. LoRA usually offers the most favorable trade-off between cost and performance. Further investigation into the effects of these methods on both model robustness and code security reveals that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Topic Modeling · Ferroelectric and Negative Capacitance Devices
