Preparing for Vascular Surgery Board Certification: A Comparative Study Using Large Language Models
Sonal Kumar, George Y Tadros, Taylor E Collignon, Otto Montero, Sophia Bampoh, Morris Sasson, Alberto Lopez

TL;DR
This study compares how well different AI tools help prepare for vascular surgery board exams, finding that Claude 3.5 performs best.
Contribution
The study evaluates and compares the effectiveness of large language models in vascular surgery board exam preparation.
Findings
Claude 3.5 achieved the highest accuracy (65.7%) in answering vascular surgery questions.
Claude 3.5 showed significant performance differences across disciplines like lower extremity and cerebrovascular conditions.
Current LLMs do not fully meet the evolving needs of vascular surgery education.
Abstract
Introduction and aim Large language models (LLMs) are transforming medical education by offering innovative methods to enhance teaching and learning. Despite their demonstrated potential, research on its use in vascular surgery is limited. This study aimed to evaluate and compare the effectiveness of LLM in preparing for vascular surgery board certification exams, exploring their potential as educational supplements. Methods We selected 269 text-only multiple-choice questions of 642 from the Vascular Education and Self-Assessment Program (VESAP) version 6. We excluded 143 image-based questions. One independent reviewer input questions into the following four AI tools: ChatGPT 3.5 (San Francisco, CA: OpenAI), Google Gemini (London, UK: Google DeepMind), Microsoft Bing (Redmond, WA: Microsoft), and Claude 3.5 (San Francisco, CA: Anthropic Inc.). Each question with answer choices was…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Radiomics and Machine Learning in Medical Imaging · Cardiac, Anesthesia and Surgical Outcomes
