Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs
Mariela M. Nina, Caio Veloso Costa, Lilian Berton, Didier A. Vega-Oliveros

TL;DR
This study systematically evaluates parameter-efficient fine-tuning methods for Portuguese QA using BERTimbau, demonstrating significant efficiency gains and comparable performance to larger models, with an exploratory comparison to generative LLMs.
Contribution
It provides a comprehensive comparison of PEFT techniques on BERTimbau for Portuguese QA and highlights the efficiency advantages over large generative models.
Findings
LoRA achieves 95.8% of baseline performance with 73.5% less training time.
Higher learning rates significantly improve PEFT performance.
Larger models are more resilient to quantization, losing fewer F1 points.
Abstract
Although large language models have transformed natural language processing, their computational costs create accessibility barriers for low-resource languages such as Brazilian Portuguese. This work presents a systematic evaluation of Parameter-Efficient Fine-Tuning (PEFT) and quantization techniques applied to BERTimbau for Question Answering on SQuAD-BR, the Brazilian Portuguese translation of SQuAD v1. We evaluate 40 configurations combining four PEFT methods (LoRA, DoRA, QLoRA, QDoRA) across two model sizes (Base: 110M, Large: 335M parameters). Our findings reveal three critical insights: (1) LoRA achieves 95.8\% of baseline performance on BERTimbau-Large while reducing training time by 73.5\% (F1=81.32 vs 84.86); (2) higher learning rates (2e-4) substantially improve PEFT performance, with F1 gains of up to +19.71 points over standard rates; and (3) larger models show twice the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications
