Optimizing Retrieval-Augmented Generation (RAG) for Colloquial Cantonese: A LoRA-Based Systematic Review
David Santandreu Calonge (1), Linda Smail (2) ((1) Center for Teaching, Learning, Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, United Arab Emirates, (2) College of Interdisciplinary Studies, Zayed University, Dubai, United Arab Emirates)

TL;DR
This systematic review explores how LoRA-based parameter-efficient fine-tuning enhances Retrieval-Augmented Generation systems for Cantonese, addressing challenges in linguistic authenticity, data scarcity, and computational efficiency.
Contribution
It provides a comprehensive analysis of LoRA integration in RAG systems for Cantonese, highlighting methods to improve efficiency and authenticity in low-resource dialectal settings.
Findings
LoRA reduces trainable parameters significantly.
Dynamic and ensemble LoRA improve efficiency without losing accuracy.
Limitations remain in capturing fine-grained linguistic nuances.
Abstract
This review examines recent advances in Parameter-Efficient Fine-Tuning (PEFT), with a focus on Low-Rank Adaptation (LoRA), to optimize Retrieval-Augmented Generation (RAG) systems like Qwen3, DeepSeek, and Kimi. These systems face challenges in understanding and generating authentic Cantonese colloquial expressions due to limited annotated data and linguistic variability. The review evaluates the integration of LoRA within RAG frameworks, benchmarks PEFT methods for retrieval and generation accuracy, identify domain adaptation strategies under limited data, and compares fine-tuning techniques aimed at improving semantic fidelity under data-scarce conditions. A systematic analysis of recent studies employing diverse LoRA variants, synthetic data generation, user feedback integration, and adaptive parameter allocation was conducted to assess their impact on computational efficiency,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLinguistic Variation and Morphology · Natural Language Processing Techniques · Language and cultural evolution
