Semantic are Beacons: A Semantic Perspective for Unveiling   Parameter-Efficient Fine-Tuning in Knowledge Learning

Renzhi Wang; Piji Li

arXiv:2405.18292·cs.CL·May 29, 2024

Semantic are Beacons: A Semantic Perspective for Unveiling Parameter-Efficient Fine-Tuning in Knowledge Learning

Renzhi Wang, Piji Li

PDF

Open Access 1 Video

TL;DR

This paper investigates why parameter-efficient fine-tuning struggles with factual knowledge learning in large language models, revealing semantic interference issues and proposing strategies to improve knowledge retention.

Contribution

It introduces a semantic perspective to analyze PEFT limitations and proposes data filtering and re-weighted learning strategies to enhance knowledge learning in LLMs.

Findings

01

PEFT can push models away from knowledge targets

02

Multiple knowledge sources interfere, hindering learning

03

Proposed strategies improve knowledge retention in experiments

Abstract

Parameter-Efficient Fine-Tuning (PEFT) methods enable efficient adaptation of Large Language Models (LLMs) to various downstream applications. However, the effectiveness of the PEFT diminishes notably when downstream tasks require accurate learning of factual knowledge. In this paper, we adopt a semantic perspective to investigate this phenomenon, uncovering the reasons behind PEFT's limitations in knowledge learning task. Our findings reveal that: (1) PEFT presents a notable risk of pushing the model away from the intended knowledge target; (2) multiple knowledge interfere with each other, and such interference suppresses the learning and expression of knowledge features. Based on these insights, we introduce a data filtering strategy to exclude data that is detrimental to knowledge learning and a re-weighted learning strategy to make the model attentive to semantic distance during…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Semantic are Beacons: A Semantic Perspective for Unveiling Parameter-Efficient Fine-Tuning in Knowledge Learning· underline

Taxonomy

TopicsNeural Networks and Applications