Hypercube-Based Retrieval-Augmented Generation for Scientific Question-Answering

Jimeng Shi; Sizhe Zhou; Bowen Jin; Wei Hu; Runchu Tian; Shaowen Wang; Giri Narasimhan; Jiawei Han

arXiv:2505.19288·cs.LG·August 5, 2025

Hypercube-Based Retrieval-Augmented Generation for Scientific Question-Answering

Jimeng Shi, Sizhe Zhou, Bowen Jin, Wei Hu, Runchu Tian, Shaowen Wang, Giri Narasimhan, Jiawei Han

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces Hypercube-RAG, a multi-dimensional retrieval framework that enhances scientific question-answering by leveraging structured semantic information, improving accuracy, efficiency, and explainability over existing methods.

Contribution

The paper proposes a novel hypercube-based indexing and retrieval method for RAG, capturing multi-dimensional semantic information for more precise and efficient scientific QA.

Findings

01

Improves response accuracy by 3.7% over baseline

02

Increases retrieval accuracy by 5.3%

03

Achieves 10-100x faster retrieval speed

Abstract

Large language models (LLMs) often need to incorporate external knowledge to solve theme-specific problems. Retrieval-augmented generation (RAG) has shown its high promise, empowering LLMs to generate more qualified responses with retrieved external data and knowledge. However, most RAG methods retrieve relevant documents based on either sparse or dense retrieval methods or their combinations, which overlooks the essential, multi-dimensional, and structured semantic information present in documents. This structured information plays a critical role in finding concise yet highly relevant information for domain knowledge-intensive tasks, such as scientific question-answering (QA). In this work, we introduce a multi-dimensional (cube) structure, Hypercube, which can index and allocate documents in a pre-defined multi-dimensional space. Built on the hypercube, we further propose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jimengshi/hypercube-rag
noneOfficial

Datasets

Rtian/hypercube-rag
dataset· 145 dl
145 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Advanced Graph Neural Networks

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · Softmax · WordPiece · Weight Decay · Multi-Head Attention · Layer Normalization · Byte Pair Encoding