Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large   Language Models

Shayekh Bin Islam; Md Asib Rahman; K S M Tozammel Hossain; Enamul; Hoque; Shafiq Joty; Md Rizwan Parvez

arXiv:2410.01782·cs.CL·October 3, 2024·2 cites

Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models

Shayekh Bin Islam, Md Asib Rahman, K S M Tozammel Hossain, Enamul, Hoque, Shafiq Joty, Md Rizwan Parvez

PDF

Open Access 1 Repo

TL;DR

Open-RAG enhances open-source LLMs' reasoning by transforming them into sparse MoE models, improving accuracy in complex tasks through dynamic expert selection and hybrid retrieval strategies.

Contribution

The paper introduces Open-RAG, a novel framework that significantly improves reasoning in open-source LLMs by transforming them into sparse MoE models with dynamic expert navigation and hybrid retrieval.

Findings

01

Open-RAG outperforms state-of-the-art models like ChatGPT in knowledge-intensive tasks.

02

The hybrid retrieval method balances performance and inference speed effectively.

03

Transforming LLMs into sparse MoE models enhances reasoning capabilities.

Abstract

Retrieval-Augmented Generation (RAG) has been shown to enhance the factual accuracy of Large Language Models (LLMs), but existing methods often suffer from limited reasoning capabilities in effectively using the retrieved evidence, particularly when using open-source LLMs. To mitigate this gap, we introduce a novel framework, Open-RAG, designed to enhance reasoning capabilities in RAG with open-source LLMs. Our framework transforms an arbitrary dense LLM into a parameter-efficient sparse mixture of experts (MoE) model capable of handling complex reasoning tasks, including both single- and multi-hop queries. Open-RAG uniquely trains the model to navigate challenging distractors that appear relevant but are misleading. As a result, Open-RAG leverages latent learning, dynamically selecting relevant experts and integrating external knowledge effectively for more accurate and contextually…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ShayekhBinIslam/openrag
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsAttention Is All You Need · Attention Dropout · WordPiece · Linear Warmup With Linear Decay · Linear Layer · Weight Decay · Byte Pair Encoding · BERT · Softmax · Dropout