Evaluating ChatGPT-5’s Performance in Answering Common Patient Questions About Femoroacetabular Impingement and Hip Arthroscopy
Maximilian Voss, Hannah Jaeger, Mikhail Salzmann, Robert Prill, Timoty Osterberger, Ingo J. Banke, Nikolai Ramadanov

TL;DR
This study shows that ChatGPT-5 provides accurate and clear answers to patient questions about hip conditions and surgery, making it a useful educational tool in orthopedics.
Contribution
The study evaluates ChatGPT-5's performance on FAIS and HAS, showing improvements over earlier models in accuracy and completeness.
Findings
ChatGPT-5 received high scores for accuracy, clarity, and relevance in answering patient questions about FAIS and HAS.
Inter-rater reliability was moderate to excellent, with high agreement between two orthopedic surgeons.
Responses were free of factual errors, but some were brief, slightly affecting completeness.
Abstract
Hip arthroscopy (HAS) is widely used to treat femoroacetabular impingement syndrome (FAIS), and many patients rely on online resources for medical information. Large language models (LLMs) such as ChatGPT have shown potential as supplementary educational tools in orthopedics; however, existing evaluations are limited to earlier model generations with variable accuracy and completeness. This study aimed to evaluate the accuracy, clarity, relevance, and completeness of ChatGPT-5 responses to common patient questions regarding FAIS and HAS. ChatGPT-5 was used to generate 25 frequently asked patient questions and corresponding answers related to hip preservation. Two fellowship-trained hip preservation surgeons independently evaluated each response using a five-point Likert scale across four predefined domains: relevance, accuracy, clarity, and completeness. Descriptive statistics were…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Hip disorders and treatments · Topic Modeling
