Large Language Model's Multi-Capability Alignment in Biomedical Domain
Wentao Wu, Linqing Chen, Hanmeng Zhong, Weilei Wang

TL;DR
This paper introduces BalancedBio, a theoretically grounded framework for efficient biomedical reasoning in AI, ensuring safety, capability integration, and improved real-world deployment performance.
Contribution
It presents a novel framework with new theorems, synthetic data generation methods, and optimization techniques for safe, multi-capability biomedical AI models.
Findings
Achieved state-of-the-art performance in biomedical tasks.
Proved safety and capability preservation bounds.
Reduced deployment costs and improved diagnostic accuracy.
Abstract
BalancedBio is a theoretically grounded framework for parameter-efficient biomedical reasoning, addressing multi-capability integration in domain-specific AI alignment. It establishes the Biomedical Multi-Capability Convergence Theorem, proving orthogonal gradient spaces are essential to prevent capability interference for safe deployment. Key innovations include: (1) Medical Knowledge Grounded Synthetic Generation (MKGSG), extending Source2Synth with clinical workflow constraints and medical ontology validation for factual accuracy and safety; and (2) Capability Aware Group Relative Policy Optimization, deriving optimal hybrid reward weighting to maintain orthogonality in RL, using a reward model with rule-based and model-based scores adapted to biomedical tasks. Mathematical analysis proves Pareto-optimal convergence, preserving performance across capabilities. It achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
