Leveraging Lora Fine-Tuning and Knowledge Bases for Construction Identification

Liu Kaipeng; Wu Ling

arXiv:2601.13105·cs.CL·January 21, 2026

Leveraging Lora Fine-Tuning and Knowledge Bases for Construction Identification

Liu Kaipeng, Wu Ling

PDF

Open Access

TL;DR

This paper presents a method combining LoRA fine-tuning and knowledge bases to improve the automatic identification of the English ditransitive construction, demonstrating significant performance gains over baseline models.

Contribution

It introduces a novel approach integrating LoRA fine-tuning with RAG for construction identification, showing improved semantic understanding over pattern matching.

Findings

01

LoRA fine-tuning outperforms baseline models

02

Semantic understanding improves with fine-tuning

03

Model achieves higher accuracy on annotated corpus

Abstract

This study investigates the automatic identification of the English ditransitive construction by integrating LoRA-based fine-tuning of a large language model with a Retrieval-Augmented Generation (RAG) framework.A binary classification task was conducted on annotated data from the British National Corpus. Results demonstrate that a LoRA-fine-tuned Qwen3-8B model significantly outperformed both a native Qwen3-MAX model and a theory-only RAG system. Detailed error analysis reveals that fine-tuning shifts the model's judgment from a surface-form pattern matching towards a more semantically grounded understanding based.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification