Are Two LLMs Better Than One? A Student-Teacher Dual-Head LLMs Architecture for Pharmaceutical Content Optimization
Suyash Mishra, Qiang Li, Anubhav Girdhar

TL;DR
This paper presents LRBTC, a dual-model architecture combining language and vision models with human oversight to improve quality control of pharmaceutical content, achieving high accuracy and reducing violations.
Contribution
It introduces a novel Student-Teacher dual model framework with waterfall rule filtering for scalable, verifiable content validation in regulated industries.
Findings
Achieves 83.0% F1 and 97.5% recall on AIReg-Bench.
Reduces missed violations by 5x compared to Gemini 2.5 Pro.
Improves mean accuracy by 26.7% on CSpelling.
Abstract
Large language models (LLMs) are increasingly used to create content in regulated domains such as pharmaceuticals, where outputs must be scientifically accurate and legally compliant. Manual quality control (QC) is slow, error prone, and can become a publication bottleneck. We introduce LRBTC, a modular LLM and vision language model (VLM) driven QC architecture covering Language, Regulatory, Brand, Technical, and Content Structure checks. LRBTC combines a Student-Teacher dual model architecture, human in the loop (HITL) workflow with waterfall rule filtering to enable scalable, verifiable content validation and optimization. On AIReg-Bench, our approach achieves 83.0% F1 and 97.5% recall, reducing missed violations by 5x compared with Gemini 2.5 Pro. On CSpelling, it improves mean accuracy by 26.7%. Error analysis further reveals that while current models are strong at detecting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Machine Learning in Healthcare
