Evaluating ChatGPT-5’s Performance in Answering Common Patient Questions About Femoroacetabular Impingement and Hip Arthroscopy

Maximilian Voss; Hannah Jaeger; Mikhail Salzmann; Robert Prill; Timoty Osterberger; Ingo J. Banke; Nikolai Ramadanov

PMC · DOI:10.1007/s43465-026-01696-3·February 2, 2026

Evaluating ChatGPT-5’s Performance in Answering Common Patient Questions About Femoroacetabular Impingement and Hip Arthroscopy

Maximilian Voss, Hannah Jaeger, Mikhail Salzmann, Robert Prill, Timoty Osterberger, Ingo J. Banke, Nikolai Ramadanov

PDF

Open Access

TL;DR

This study shows that ChatGPT-5 provides accurate and clear answers to patient questions about hip conditions and surgery, making it a useful educational tool in orthopedics.

Contribution

The study evaluates ChatGPT-5's performance on FAIS and HAS, showing improvements over earlier models in accuracy and completeness.

Findings

01

ChatGPT-5 received high scores for accuracy, clarity, and relevance in answering patient questions about FAIS and HAS.

02

Inter-rater reliability was moderate to excellent, with high agreement between two orthopedic surgeons.

03

Responses were free of factual errors, but some were brief, slightly affecting completeness.

Abstract

Hip arthroscopy (HAS) is widely used to treat femoroacetabular impingement syndrome (FAIS), and many patients rely on online resources for medical information. Large language models (LLMs) such as ChatGPT have shown potential as supplementary educational tools in orthopedics; however, existing evaluations are limited to earlier model generations with variable accuracy and completeness. This study aimed to evaluate the accuracy, clarity, relevance, and completeness of ChatGPT-5 responses to common patient questions regarding FAIS and HAS. ChatGPT-5 was used to generate 25 frequently asked patient questions and corresponding answers related to hip preservation. Two fellowship-trained hip preservation surgeons independently evaluated each response using a five-point Likert scale across four predefined domains: relevance, accuracy, clarity, and completeness. Descriptive statistics were…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Diseases1

FAIS

Figures1

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Hip disorders and treatments · Topic Modeling