# ChatGPT Answers the 110-Question Laboratory Enzymology Student Exam: Pass or Fail?

**Authors:** Berina Hasanefendic, Aleksandra Pasic, Selvedina Duskan, Emir Sehercehajic, Altaira Jazic Durmisevic

PMC · DOI: 10.7759/cureus.82168 · Cureus · 2025-04-13

## TL;DR

This study tested ChatGPT's knowledge on a lab enzymology exam and found it scored lower than students, suggesting it could help in education but not replace humans.

## Contribution

The study compares ChatGPT versions and students on a lab enzymology exam, revealing AI's strengths and weaknesses in this domain.

## Key findings

- ChatGPT-4.0 scored higher than ChatGPT-3.5 but still lower than students.
- Chatbots performed best on metabolism-related questions and worst on enzyme lab analysis.
- ChatGPT scored below the passing threshold of 60%.

## Abstract

Introduction

Chatbots like ChatGPT have attracted a lot of interest lately due to their ability to generate human-like responses. Their reliability and accuracy are still questionable, and they are the topic of many studies in different fields. Therefore, the aim of this study was to examine the knowledge of two versions of chatbots regarding laboratory enzymology and to compare it with the average knowledge of students for the purpose of considering the use of ChatGPT in providing answers in this field.

Material and methods

An exam with 110 questions covering four topics was answered by students and ChatGPT-3.5 and ChatGPT-4.0. The accuracy of the answers of 52 students and ChatGPT was evaluated. The accuracy of answers between students and artificial intelligence was compared, and the percentage of passing the exam was 60%. All responses were reviewed by two authors with full interrater agreement.

Results

Total scores for students, ChatGPT-3.5, and ChatGPT-4.0 were 85.46%, 52.73%, and 74.55% (p < 0.05), whereby ChatGPT-4.0 achieved better results compared to the other chatbot. ChatGPT-3.5 and ChatGPT-4.0 achieved the best results on questions about enzymes in metabolism. The lowest scores for both chatbots were observed in the laboratory analysis of enzymes.

Conclusion

ChatGPT showed average results in the Laboratory Enzymology exam and scored lower than students. This proved that chatbots could be a potential tool for learning and eventual implementation in higher and/or medical education with extensive optimization but still cannot replace a human.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12070819/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12070819/full.md

## References

16 references — full list in the complete paper: https://tomesphere.com/paper/PMC12070819/full.md

---
Source: https://tomesphere.com/paper/PMC12070819