Position Debiasing Fine-Tuning for Causal Perception in Long-Term   Dialogue

Shixuan Fan; Wei Wei; Wendi Li; Xian-Ling Mao; Wenfeng Xie; Dangyang; Chen

arXiv:2406.02002·cs.CL·June 5, 2024

Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue

Shixuan Fan, Wei Wei, Wendi Li, Xian-Ling Mao, Wenfeng Xie, Dangyang, Chen

PDF

Open Access

TL;DR

This paper introduces a novel fine-tuning framework called CPD that reduces position bias in long-term dialogue generation by enhancing causal perception, leading to more relevant and context-aware responses.

Contribution

The paper proposes a perturbation-based causal variable discovery method and a local-position awareness technique to improve causal perception during fine-tuning of large language models.

Findings

01

Effective in alleviating position bias in multiple LLMs

02

Significant improvements over baseline models

03

Enhanced relevance and causality in generated responses

Abstract

The core of the dialogue system is to generate relevant, informative, and human-like responses based on extensive dialogue history. Recently, dialogue generation domain has seen mainstream adoption of large language models (LLMs), due to its powerful capability in generating utterances. However, there is a natural deficiency for such models, that is, inherent position bias, which may lead them to pay more attention to the nearby utterances instead of causally relevant ones, resulting in generating irrelevant and generic responses in long-term dialogue. To alleviate such problem, in this paper, we propose a novel method, named Causal Perception long-term Dialogue framework (CPD), which employs perturbation-based causal variable discovery method to extract casually relevant utterances from the dialogue history and enhances model causal perception during fine-tuning. Specifically, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems