RACE-Align: Retrieval-Augmented and Chain-of-Thought Enhanced Preference Alignment for Large Language Models

Qihang Yan; Xinyu Zhang; Luming Guo; Qi Zhang; Feifan Liu

arXiv:2506.02726·cs.CL·June 4, 2025

RACE-Align: Retrieval-Augmented and Chain-of-Thought Enhanced Preference Alignment for Large Language Models

Qihang Yan, Xinyu Zhang, Luming Guo, Qi Zhang, Feifan Liu

PDF

Open Access

TL;DR

RACE-Align is a novel framework that enhances large language models by integrating retrieval-augmented knowledge and explicit chain-of-thought reasoning into preference alignment, significantly improving accuracy and interpretability in specialized domains.

Contribution

It introduces a new preference data construction method combining retrieval and chain-of-thought, and applies DPO for better alignment in domain-specific tasks.

Findings

01

Outperforms baseline models in accuracy and reasoning depth.

02

Improves interpretability and knowledge application in TCM domain.

03

Enhances logicality and information richness of model responses.

Abstract

Large Language Models (LLMs) struggle with accuracy, domain-specific reasoning, and interpretability in vertical domains. Traditional preference alignment methods like Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO) often overlook the underlying knowledge sources and reasoning logic. This paper introduces RACE-Align (Retrieval-Augmented and Chain-of-Thought Enhanced Alignment), a novel framework designed to address these limitations. RACE-Align systematically constructs a binary preference dataset incorporating external knowledge support and explicit Chain-of-Thought (CoT) reasoning, then aligns LLMs using the DPO algorithm. The core innovation lies in its preference data construction strategy: it integrates AI-driven retrieval for factual grounding, enhancing knowledgeability and accuracy, and emphasizes the optimization of domain-specific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Text Analysis Techniques · Recommender Systems and Techniques