A Human-Centric Pipeline for Aligning Large Language Models with Chinese Medical Ethics
Haoan Jin, Han Ying, Jiacheng Ji, Hanhui Xu, Mengyue Wu

TL;DR
This paper introduces MedES, a Chinese medical ethics benchmark and a guardian-in-the-loop framework to align large language models with complex medical ethical standards, improving their performance in ethical decision-making tasks.
Contribution
It presents a novel, scenario-centric benchmark and a guardian-in-the-loop alignment pipeline tailored for Chinese medical ethics, enhancing LLM ethical compliance.
Findings
Aligned model outperforms larger baselines on ethical tasks
Automated evaluator achieves over 97% accuracy in domain-specific assessments
Framework is adaptable to other legal and cultural environments
Abstract
Recent advances in large language models have enabled their application to a range of healthcare tasks. However, aligning LLMs with the nuanced demands of medical ethics, especially under complex real world scenarios, remains underexplored. In this work, we present MedES, a dynamic, scenario-centric benchmark specifically constructed from 260 authoritative Chinese medical, ethical, and legal sources to reflect the challenges in clinical decision-making. To facilitate model alignment, we introduce a guardian-in-the-loop framework that leverages a dedicated automated evaluator (trained on expert-labeled data and achieving over 97% accuracy within our domain) to generate targeted prompts and provide structured ethical feedback. Using this pipeline, we align a 7B-parameter LLM through supervised fine-tuning and domain-specific preference optimization. Experimental results, conducted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning in Healthcare · Topic Modeling · Artificial Intelligence in Healthcare and Education
