GPT-4 passes most of the 297 written Polish Board Certification Examinations
Jakub Pokrywka, Jeremi Kaczmarek, Edward Gorzela\'nczyk

TL;DR
This study evaluates GPT-4's ability to pass Polish medical certification exams, showing significant progress over GPT-3.5 and highlighting potential applications of AI in healthcare, with GPT-4 passing 75% of the tests.
Contribution
First comprehensive assessment of GPT models on Polish medical certification exams across multiple specialties, demonstrating GPT-4's substantial passing rate and potential for medical AI applications.
Findings
GPT-3.5 did not pass any exams.
GPT-4 passed 75% of exams, with 222 out of 297.
Performance varied across specialties, excelling in some and failing in others.
Abstract
Introduction: Recently, the effectiveness of Large Language Models (LLMs) has increased rapidly, allowing them to be used in a great number of applications. However, the risks posed by the generation of false information through LLMs significantly limit their applications in sensitive areas such as healthcare, highlighting the necessity for rigorous validations to determine their utility and reliability. To date, no study has extensively compared the performance of LLMs on Polish medical examinations across a broad spectrum of specialties on a very large dataset. Objectives: This study evaluated the performance of three Generative Pretrained Transformer (GPT) models on the Polish Board Certification Exam (Pa\'nstwowy Egzamin Specjalizacyjny, PES) dataset, which consists of 297 tests. Methods: We developed a software program to download and process PES exams and tested the performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCardiac, Anesthesia and Surgical Outcomes
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Dropout · Label Smoothing · Residual Connection · Softmax · Position-Wise Feed-Forward Layer · Byte Pair Encoding
