MuISQA: Multi-Intent Retrieval-Augmented Generation for Scientific Question Answering
Zhiyuan Li, Haisheng Yu, Guangchuan Guo, Nan Zhou, Jiajun Zhang

TL;DR
This paper introduces MuISQA, a benchmark and retrieval framework for scientific question answering involving multiple intents, improving evidence coverage and retrieval accuracy over traditional single-intent RAG systems.
Contribution
The paper presents MuISQA, a new benchmark for multi-intent scientific QA and an intent-aware retrieval method leveraging LLMs and RRF for better evidence coverage.
Findings
Outperforms conventional RAG methods in evidence coverage
Achieves higher retrieval accuracy on MuISQA and other datasets
Effectively balances diverse intent coverage with reduced redundancy
Abstract
Complex scientific questions often entail multiple intents, such as identifying gene mutations and linking them to related diseases. These tasks require evidence from diverse sources and multi-hop reasoning, while conventional retrieval-augmented generation (RAG) systems are usually single-intent oriented, leading to incomplete evidence coverage. To assess this limitation, we introduce the Multi-Intent Scientific Question Answering (MuISQA) benchmark, which is designed to evaluate RAG systems on heterogeneous evidence coverage across sub-questions. In addition, we propose an intent-aware retrieval framework that leverages large language models (LLMs) to hypothesize potential answers, decompose them into intent-specific queries, and retrieve supporting passages for each underlying intent. The retrieved fragments are then aggregated and re-ranked via Reciprocal Rank Fusion (RRF) to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Advanced Graph Neural Networks
