Preparing for Vascular Surgery Board Certification: A Comparative Study Using Large Language Models

Sonal Kumar; George Y Tadros; Taylor E Collignon; Otto Montero; Sophia Bampoh; Morris Sasson; Alberto Lopez

PMC · DOI:10.7759/cureus.83848·May 10, 2025

Preparing for Vascular Surgery Board Certification: A Comparative Study Using Large Language Models

Sonal Kumar, George Y Tadros, Taylor E Collignon, Otto Montero, Sophia Bampoh, Morris Sasson, Alberto Lopez

PDF

Open Access

TL;DR

This study compares how well different AI tools help prepare for vascular surgery board exams, finding that Claude 3.5 performs best.

Contribution

The study evaluates and compares the effectiveness of large language models in vascular surgery board exam preparation.

Findings

01

Claude 3.5 achieved the highest accuracy (65.7%) in answering vascular surgery questions.

02

Claude 3.5 showed significant performance differences across disciplines like lower extremity and cerebrovascular conditions.

03

Current LLMs do not fully meet the evolving needs of vascular surgery education.

Abstract

Introduction and aim Large language models (LLMs) are transforming medical education by offering innovative methods to enhance teaching and learning. Despite their demonstrated potential, research on its use in vascular surgery is limited. This study aimed to evaluate and compare the effectiveness of LLM in preparing for vascular surgery board certification exams, exploring their potential as educational supplements. Methods We selected 269 text-only multiple-choice questions of 642 from the Vascular Education and Self-Assessment Program (VESAP) version 6. We excluded 143 image-based questions. One independent reviewer input questions into the following four AI tools: ChatGPT 3.5 (San Francisco, CA: OpenAI), Google Gemini (London, UK: Google DeepMind), Microsoft Bing (Redmond, WA: Microsoft), and Claude 3.5 (San Francisco, CA: Anthropic Inc.). Each question with answer choices was…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Diseases6

conditions LLMs disorders of peripheral blood vessels VSQE aortoiliac disease cerebrovascular disease

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Radiomics and Machine Learning in Medical Imaging · Cardiac, Anesthesia and Surgical Outcomes