PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark

Robert Belanec; Branislav Pecher; Ivan Srba; Maria Bielikova

arXiv:2511.21285·cs.CL·May 14, 2026

PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark

Robert Belanec, Branislav Pecher, Ivan Srba, Maria Bielikova

PDF

1 Repo 1 Video

TL;DR

PEFT-Bench provides a comprehensive benchmark for evaluating parameter-efficient fine-tuning methods on large language models across multiple datasets, considering various efficiency metrics.

Contribution

It introduces PEFT-Bench, a unified benchmark with the PSCP metric to evaluate diverse PEFT methods on autoregressive LLMs comprehensively.

Findings

01

Demonstrates PEFT-Bench's application on 27 NLP datasets.

02

Introduces the PSCP metric for holistic efficiency evaluation.

03

Evaluates 7 different PEFT methods.

Abstract

Despite the state-of-the-art performance of Large Language Models (LLMs) achieved on many tasks, their massive scale often leads to high computational and environmental costs, limiting their accessibility. Parameter-Efficient Fine-Tuning (PEFT) methods address this challenge by reducing the number of trainable parameters while maintaining strong downstream performance. Despite the advances in PEFT methods, current evaluations remain limited (in terms of evaluated models and datasets) and difficult to reproduce. To bridge this gap, we introduce PEFT-Bench, a unified end-to-end benchmark for evaluating diverse PEFT methods on autoregressive LLMs. We demonstrate its usage across 27 NLP datasets and 7 PEFT methods. To account for different PEFT training and inference factors, we also introduce the PEFT Soft Cost Penalties (PSCP) metric, which takes trainable parameters, inference speed, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kinit-sk/PEFT-Bench
github

Videos

PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark· underline