Position-Aware Parameter Efficient Fine-Tuning Approach for Reducing   Positional Bias in LLMs

Zheng Zhang; Fan Yang; Ziyan Jiang; Zheng Chen; Zhengyang Zhao,; Chengyuan Ma; Liang Zhao; Yang Liu

arXiv:2404.01430·cs.CL·April 3, 2024·2 cites

Position-Aware Parameter Efficient Fine-Tuning Approach for Reducing Positional Bias in LLMs

Zheng Zhang, Fan Yang, Ziyan Jiang, Zheng Chen, Zhengyang Zhao,, Chengyuan Ma, Liang Zhao, Yang Liu

PDF

Open Access

TL;DR

This paper introduces PAPEFT, a novel fine-tuning method that reduces positional bias in large language models, thereby improving their performance on long-context tasks involving external knowledge retrieval.

Contribution

The study presents a new position-aware fine-tuning approach combining data augmentation and adapters to mitigate positional bias in LLMs, which is more effective than prompt-based methods.

Findings

01

PAPEFT reduces positional bias in LLMs.

02

Improves performance on long-context tasks.

03

Enhances knowledge retrieval accuracy.

Abstract

Recent advances in large language models (LLMs) have enhanced their ability to process long input contexts. This development is particularly crucial for tasks that involve retrieving knowledge from an external datastore, which can result in long inputs. However, recent studies show a positional bias in LLMs, demonstrating varying performance depending on the location of useful information within the input sequence. In this study, we conduct extensive experiments to investigate the root causes of positional bias. Our findings indicate that the primary contributor to LLM positional bias stems from the inherent positional preferences of different models. We demonstrate that merely employing prompt-based solutions is inadequate for overcoming the positional preferences. To address this positional bias issue of a pre-trained LLM, we developed a Position-Aware Parameter Efficient Fine-Tuning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational Technology and Assessment · Power Systems and Technologies · Natural Language Processing Techniques