Ethical Risks of Large Language Models in Medical Consultation: An Assessment Based on Reproductive Ethics
Hanhui Xu, Jiacheng Ji, Haoan Jin, Han Ying, Mengyue Wu

TL;DR
This study systematically evaluates large language models' performance in reproductive ethics consultations, revealing significant safety risks, poor source citation, and moral reasoning flaws, thus cautioning against their autonomous use in sensitive medical contexts.
Contribution
The paper provides a comprehensive assessment of LLMs' ethical performance in reproductive health, highlighting critical safety and reasoning deficiencies specific to Chinese regulations.
Findings
29.91% risk of unsafe or misleading advice
Poor performance in citing normative sources
Instances of logical contradictions and moral violations
Abstract
Background: As large language models (LLMs) are increasingly used in healthcare and medical consultation settings, a growing concern is whether these models can respond to medical inquiries in a manner that is ethically compliant--particularly in accordance with local ethical standards. To address the pressing need for comprehensive research on reliability and safety, this study systematically evaluates LLM performance in answering questions related to reproductive ethics, specifically assessing their alignment with Chinese ethical regulations. Methods: We evaluated eight prominent LLMs (e.g., GPT-4, Claude-3.7) on a custom test set of 986 questions (906 subjective, 80 objective) derived from 168 articles within Chinese reproductive ethics regulations. Subjective responses were evaluated using a novel six-dimensional scoring rubric assessing Safety (Normative Compliance, Guidance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Simulation-Based Education in Healthcare · Explainable Artificial Intelligence (XAI)
