RAGPPI: RAG Benchmark for Protein-Protein Interactions in Drug Discovery
Youngseung Jeon, Ziwen Li, Thomas Li, JiaSyuan Chang, Morteza Ziyadi, Xiang 'Anthony' Chen

TL;DR
This paper introduces RAGPPI, a new benchmark dataset of over 4,400 question-answer pairs focused on the biological impacts of protein-protein interactions, to support RAG systems in drug discovery.
Contribution
The paper presents the first benchmark dataset for PPI impact identification, including expert-annotated and auto-evaluated QA pairs, advancing RAG applications in drug discovery.
Findings
Created a gold-standard dataset of 500 QA pairs.
Developed an ensemble auto-evaluation LLM for dataset construction.
Established RAGPPI as a resource for the research community.
Abstract
Retrieving the biological impacts of protein-protein interactions (PPIs) is essential for target identification (Target ID) in drug development. Given the vast number of proteins involved, this process remains time-consuming and challenging. Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) frameworks have supported Target ID; however, no benchmark currently exists for identifying the biological impacts of PPIs. To bridge this gap, we introduce the RAG Benchmark for PPIs (RAGPPI), a factual question-answer benchmark of 4,420 question-answer pairs that focus on the potential biological impacts of PPIs. Through interviews with experts, we identified criteria for a benchmark dataset, such as a type of QA and source. We built a gold-standard dataset (500 QA pairs) through expert-driven data annotation. We developed an ensemble auto-evaluation LLM that reflected expert…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Linear Layer · Byte Pair Encoding · Attention Dropout · Softmax · WordPiece · BART · Weight Decay · Multi-Head Attention · Attention Is All You Need
