Knowledgeable Preference Alignment for LLMs in Domain-specific Question   Answering

Yichi Zhang; Zhuo Chen; Yin Fang; Yanxi Lu; Fangming Li; Wen Zhang,; Huajun Chen

arXiv:2311.06503·cs.CL·June 11, 2024·5 cites

Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering

Yichi Zhang, Zhuo Chen, Yin Fang, Yanxi Lu, Fangming Li, Wen Zhang,, Huajun Chen

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces KnowPAT, a novel alignment method that enhances large language models for domain-specific question answering by aligning model preferences with human expectations, improving real-world application performance.

Contribution

The paper proposes a new preference alignment approach, KnowPAT, which constructs preference sets and a unified alignment objective to better tailor LLM responses to human and domain-specific needs.

Findings

01

KnowPAT outperforms 15 baseline methods in domain-specific QA tasks.

02

It effectively aligns LLM preferences with human expectations.

03

Experiments demonstrate improved accuracy and user satisfaction.

Abstract

Deploying large language models (LLMs) to real scenarios for domain-specific question answering (QA) is a key thrust for LLM applications, which poses numerous challenges, especially in ensuring that responses are both accommodating to user requirements and appropriately leveraging domain-specific knowledge bases. They are the two major difficulties for LLM application as vanilla fine-tuning falls short of addressing. Combining these requirements, we conceive of them as the requirement for the model's preference to be harmoniously aligned with humans'. Thus, we introduce Knowledgeable Preference AlignmenT (KnowPAT), which constructs two kinds of preference sets to tackle the two issues. Besides, we design a new alignment objective to align the LLM preference with different human preferences uniformly, aiming to optimize LLM performance in real-world, domain-specific QA settings.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zjukg/knowpat
pytorchOfficial

Videos

Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks

MethodsALIGN