Empirical Studies of Parameter Efficient Methods for Large Language   Models of Code and Knowledge Transfer to R

Amirreza Esmaeili; Iman Saberi; Fatemeh H. Fard

arXiv:2405.01553·cs.SE·January 28, 2025

Empirical Studies of Parameter Efficient Methods for Large Language Models of Code and Knowledge Transfer to R

Amirreza Esmaeili, Iman Saberi, Fatemeh H. Fard

PDF

Open Access 1 Repo

TL;DR

This study evaluates PEFT methods like LoRA, Compacter, and IA^3 for large language models in code tasks, focusing on knowledge transfer to the under-explored language R, and compares their efficiency and effectiveness.

Contribution

It provides the first comprehensive evaluation of PEFT methods on large models for code in low-resource languages like R, highlighting LoRA's superior performance and resource efficiency.

Findings

01

LoRA outperforms other PEFT methods in all settings.

02

Compacter offers significant resource efficiency with minimal performance loss.

03

The number of trainable parameters impacts code accuracy more than PEFT architecture.

Abstract

Parameter Efficient Fine-Tuning (PEFT) methods are proposed as an alternative fine-tuning approach for Large Language Models (LLM) to minimize high training costs. While prior research demonstrates the effectiveness of PEFT methods in knowledge transfer using smaller language models, their application to larger LLMs, particularly in low-resource and unseen programming languages such as R, remains under-explored. In this work, we evaluate PEFT methods, LoRA, Compacter, and IA^3 on LLMs for code summarization and generation, with a particular emphasis on knowledge transfer to R as an unseen under-explored target language. Our experiments reveal that LoRA consistently outperforms Compacter and IA^3 in all settings, while Compacter offers significant resource efficiency with minimal performance trade-offs. Additionally, we find that the number of trainable parameters has a greater influence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Amirresm/empirical_peft_suite
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Linear Layer · Multi-Head Attention · Dense Connections · Gated Linear Unit · Attention Dropout · Adafactor · SentencePiece