How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions
Bojana Ba\v{s}aragin, Adela Ljaji\'c, Darija Medvecki, Lorenzo, Cassano, Milo\v{s} Ko\v{s}prdi\'c, Nikola Milo\v{s}evi\'c

TL;DR
This paper presents a retrieval-augmented generation system for biomedical question answering that improves answer reliability by referencing relevant PubMed abstracts, enabling verification and enhancing factual accuracy.
Contribution
It introduces a fine-tuned LLM system that references PubMed abstracts in answers, improving reliability and transparency in biomedical question answering.
Findings
Retrieval system improves PubMed search accuracy by 23%.
Fine-tuned LLM achieves comparable referencing to GPT-4 Turbo.
Publicly available dataset and models for biomedical QA.
Abstract
Large language models (LLMs) have recently become the leading source of answers for users' questions online. Despite their ability to offer eloquent answers, their accuracy and reliability can pose a significant challenge. This is especially true for sensitive domains such as biomedicine, where there is a higher need for factually correct answers. This paper introduces a biomedical retrieval-augmented generation (RAG) system designed to enhance the reliability of generated responses. The system is based on a fine-tuned LLM for the referenced question-answering, where retrieved relevant abstracts from PubMed are passed to LLM's context as input through a prompt. Its output is an answer based on PubMed abstracts, where each statement is referenced accordingly, allowing the users to verify the answer. Our retrieval system achieves an absolute improvement of 23% compared to the PubMed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Text Readability and Simplification
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Layer Normalization · Linear Layer · Label Smoothing · Attention Dropout · Linear Warmup With Linear Decay · Adam · Dropout
