Bailicai: A Domain-Optimized Retrieval-Augmented Generation Framework for Medical Applications
Cui Long, Yongbin Liu, Chunping Ouyang, Ying Yu

TL;DR
Bailicai is a novel framework that enhances medical domain-specific large language models by integrating retrieval-augmented generation, significantly improving accuracy, reducing hallucinations, and outperforming existing models on multiple benchmarks.
Contribution
The paper introduces Bailicai, a domain-optimized RAG framework that improves medical LLM performance and mitigates hallucinations compared to prior methods.
Findings
Outperforms existing medical LLMs on multiple benchmarks.
Effectively reduces hallucinations in medical text generation.
Mitigates noise issues caused by irrelevant documents in RAG.
Abstract
Large Language Models (LLMs) have exhibited remarkable proficiency in natural language understanding, prompting extensive exploration of their potential applications across diverse domains. In the medical domain, open-source LLMs have demonstrated moderate efficacy following domain-specific fine-tuning; however, they remain substantially inferior to proprietary models such as GPT-4 and GPT-3.5. These open-source models encounter limitations in the comprehensiveness of domain-specific knowledge and exhibit a propensity for 'hallucinations' during text generation. To mitigate these issues, researchers have implemented the Retrieval-Augmented Generation (RAG) approach, which augments LLMs with background information from external knowledge bases while preserving the model's internal parameters. However, document noise can adversely affect performance, and the application of RAG in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Linear Layer · Attention Dropout · Label Smoothing · Residual Connection · Linear Warmup With Linear Decay
