Resolving Knowledge Conflicts in Domain-specific Data Selection: A Case Study on Medical Instruction-tuning

Qihuang Zhong; Liang Ding; Fei Liao; Juhua Liu; Bo Du; Dacheng Tao

arXiv:2505.21958·cs.CL·May 29, 2025

Resolving Knowledge Conflicts in Domain-specific Data Selection: A Case Study on Medical Instruction-tuning

Qihuang Zhong, Liang Ding, Fei Liao, Juhua Liu, Bo Du, Dacheng Tao

PDF

Open Access

TL;DR

This paper introduces KDS, a knowledge-aware data selection framework that improves domain-specific instruction-tuning of LLMs by reducing knowledge conflicts, enhancing performance, and mitigating hallucinations, especially in medical applications.

Contribution

The paper proposes a novel KDS framework that leverages knowledge-aware metrics to select high-quality, diverse data for domain-specific instruction-tuning, addressing knowledge conflicts and hallucinations.

Findings

01

KDS outperforms baseline data selection methods in medical instruction-tuning.

02

KDS improves LLMs' domain-specific performance and generalization.

03

KDS reduces hallucination issues in LLM outputs.

Abstract

Domain-specific instruction-tuning has become the defacto standard for improving the performance of large language models (LLMs) in specialized applications, e.g., medical question answering. Since the instruction-tuning dataset might contain redundant or low-quality data, data selection (DS) is usually required to maximize the data efficiency. Despite the successes in the general domain, current DS methods often struggle to select the desired data for domain-specific instruction-tuning. One of the main reasons is that they neglect the impact of knowledge conflicts, i.e., the discrepancy between LLMs' pretrained knowledge and context knowledge of instruction data, which could damage LLMs' prior abilities and lead to hallucination. To this end, we propose a simple-yet-effective Knowledge-aware Data Selection (namely KDS) framework to select the domain-specific instruction-tuning data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications