Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs

Mariela M. Nina; Caio Veloso Costa; Lilian Berton; Didier A. Vega-Oliveros

arXiv:2603.21418·cs.CL·March 24, 2026

Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs

Mariela M. Nina, Caio Veloso Costa, Lilian Berton, Didier A. Vega-Oliveros

PDF

Open Access

TL;DR

This study systematically evaluates parameter-efficient fine-tuning methods for Portuguese QA using BERTimbau, demonstrating significant efficiency gains and comparable performance to larger models, with an exploratory comparison to generative LLMs.

Contribution

It provides a comprehensive comparison of PEFT techniques on BERTimbau for Portuguese QA and highlights the efficiency advantages over large generative models.

Findings

01

LoRA achieves 95.8% of baseline performance with 73.5% less training time.

02

Higher learning rates significantly improve PEFT performance.

03

Larger models are more resilient to quantization, losing fewer F1 points.

Abstract

Although large language models have transformed natural language processing, their computational costs create accessibility barriers for low-resource languages such as Brazilian Portuguese. This work presents a systematic evaluation of Parameter-Efficient Fine-Tuning (PEFT) and quantization techniques applied to BERTimbau for Question Answering on SQuAD-BR, the Brazilian Portuguese translation of SQuAD v1. We evaluate 40 configurations combining four PEFT methods (LoRA, DoRA, QLoRA, QDoRA) across two model sizes (Base: 110M, Large: 335M parameters). Our findings reveal three critical insights: (1) LoRA achieves 95.8\% of baseline performance on BERTimbau-Large while reducing training time by 73.5\% (F1=81.32 vs 84.86); (2) higher learning rates (2e-4) substantially improve PEFT performance, with F1 gains of up to +19.71 points over standard rates; and (3) larger models show twice the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications