Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware
Jonathan A Carlson, Robin Z Cheng, Alyssa Lange, Nadiminty Nagalakshmi, John Rabets, Tariq Shah, Puneet Sindhwani

TL;DR
This study evaluates how accurate and easy to understand AI chatbots are when answering questions about vasectomies, finding that they are mostly accurate but vary in readability.
Contribution
The study is one of the first to assess the accuracy and readability of AI chatbots for urology-related medical information.
Findings
ChatGPT provided the most accurate responses with an average score of 5.04 on the Likert scale.
Google Bard had the highest readability scores, while ChatGPT had the lowest readability.
All five chatbots scored at least 'somewhat accurate' on average, but readability varied significantly.
Abstract
Purpose Artificial intelligence (AI) has rapidly gained popularity with the growth of ChatGPT (OpenAI, San Francisco, USA) and other large-language model chatbots, and these programs have tremendous potential to impact medicine. One important area of consequence in medicine and public health is that patients may use these programs in search of answers to medical questions. Despite the increased utilization of AI chatbots by the public, there is little research to assess the reliability of ChatGPT and alternative programs when queried for medical information. This study seeks to elucidate the accuracy and readability of AI chatbots in answering patient questions regarding urology. As vasectomy is one of the most common urologic procedures, this study investigates AI-generated responses to frequently asked vasectomy-related questions. For this study, five popular and free-to-access AI…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Machine Learning in Materials Science
