Ethical Risks of Large Language Models in Medical Consultation: An Assessment Based on Reproductive Ethics

Hanhui Xu; Jiacheng Ji; Haoan Jin; Han Ying; Mengyue Wu

arXiv:2601.22621·cs.CY·February 2, 2026

Ethical Risks of Large Language Models in Medical Consultation: An Assessment Based on Reproductive Ethics

Hanhui Xu, Jiacheng Ji, Haoan Jin, Han Ying, Mengyue Wu

PDF

Open Access

TL;DR

This study systematically evaluates large language models' performance in reproductive ethics consultations, revealing significant safety risks, poor source citation, and moral reasoning flaws, thus cautioning against their autonomous use in sensitive medical contexts.

Contribution

The paper provides a comprehensive assessment of LLMs' ethical performance in reproductive health, highlighting critical safety and reasoning deficiencies specific to Chinese regulations.

Findings

01

29.91% risk of unsafe or misleading advice

02

Poor performance in citing normative sources

03

Instances of logical contradictions and moral violations

Abstract

Background: As large language models (LLMs) are increasingly used in healthcare and medical consultation settings, a growing concern is whether these models can respond to medical inquiries in a manner that is ethically compliant--particularly in accordance with local ethical standards. To address the pressing need for comprehensive research on reliability and safety, this study systematically evaluates LLM performance in answering questions related to reproductive ethics, specifically assessing their alignment with Chinese ethical regulations. Methods: We evaluated eight prominent LLMs (e.g., GPT-4, Claude-3.7) on a custom test set of 986 questions (906 subjective, 80 objective) derived from 168 articles within Chinese reproductive ethics regulations. Subjective responses were evaluated using a novel six-dimensional scoring rubric assessing Safety (Normative Compliance, Guidance…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Simulation-Based Education in Healthcare · Explainable Artificial Intelligence (XAI)