HIRAG: Hierarchical-Thought Instruction-Tuning Retrieval-Augmented Generation
YiHan Jiao, ZheHao Tan, Dan Yang, DuoLin Sun, Jie Feng, Yue Shen, Jian Wang, Peng Wei

TL;DR
This paper introduces HIRAG, a hierarchical instruction-tuning method for retrieval-augmented generation models that enhances reasoning and information filtering capabilities through multi-level chain-of-thought training.
Contribution
The paper proposes a novel hierarchical instruction-tuning approach, HIRAG, which improves RAG models' reasoning and information filtering by incorporating multi-level chain-of-thought strategies.
Findings
HIRAG significantly improves performance on multiple QA datasets.
The method enhances the model's reasoning and information filtering abilities.
Experiments demonstrate better open-book examination capabilities.
Abstract
Retrieval-augmented generation (RAG) has become a fundamental paradigm for addressing the challenges faced by large language models in handling real-time information and domain-specific problems. Traditional RAG systems primarily rely on the in-context learning (ICL) capabilities of the large language model itself. Still, in-depth research on the specific capabilities needed by the RAG generation model is lacking, leading to challenges with inconsistent document quality and retrieval system imperfections. Even the limited studies that fine-tune RAG generative models often \textit{lack a granular focus on RAG task} or \textit{a deeper utilization of chain-of-thought processes}. To address this, we propose that RAG models should possess three progressively hierarchical abilities (1) Filtering: the ability to select relevant information; (2) Combination: the ability to combine semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Information Retrieval and Search Behavior
