BacPrep: Lessons from Deploying an LLM-Based Bacalaureat Assessment Platform
Adrian-Marius Dumitran, Radu Dita, Angela Liliana Dumitran

TL;DR
BacPrep is an online platform utilizing LLMs to automate assessment of Romanian Bacalaureat exam solutions, revealing challenges and proposing architectural improvements for reliable grading.
Contribution
This work introduces BacPrep, an LLM-based assessment platform for exams, highlighting key challenges and proposing a novel architecture for improved grading consistency.
Findings
LLM grading shows inconsistency across runs
Arithmetic errors affect score aggregation
Performance degrades with large prompts
Abstract
Accessing quality preparation and feedback for the Romanian Bacalaureat exam is challenging, particularly for students in remote or underserved areas. This paper presents BacPrep, an experimental online platform exploring Large Language Model (LLM) potential for automated assessment, aiming to offer a free, accessible resource. Using official exam questions from the last 5 years, BacPrep employs the latest available Gemini Flash model (currently Gemini 2.5 Flash, via the \texttt{gemini-flash-latest} endpoint) to prioritize user experience quality during the data collection phase, with model versioning to be locked for subsequent rigorous evaluation. The platform has collected over 100 student solutions across Computer Science and Romanian Language exams, enabling preliminary assessment of LLM grading quality. This revealed several significant challenges: grading inconsistency across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
