Scalable and Personalized Oral Assessments Using Voice AI
Panos Ipeirotis, Konstantinos Rizakos

TL;DR
This paper presents Viva, an AI-driven system for scalable, personalized oral assessments that reduce grading costs and improve exam authenticity using voice AI and multi-LLM evaluation.
Contribution
The authors introduce Viva, a novel voice AI system for conducting and grading personalized oral exams efficiently and cost-effectively, with insights on system design and failure analysis.
Findings
Grading cost remained under one dollar per exam for undergraduate cohorts.
Decomposing tasks into modules improves reliability and flexibility.
Randomization and voice quality significantly impact exam authenticity.
Abstract
Students in our AI/ML course submitted polished, well-argued project analyses. Then, in class discussion, we asked them to walk through a single choice from their own work. Many could not. The writing looked great. The understanding often wasn't. Oral examinations retain an evidentiary link where written work no longer does: a student who can reason aloud, defend a decision under follow-up, and adapt when pushed demonstrates something no submitted document can certify. The obstacle has always been cost. A 25-minute oral reviewed by two graders takes roughly 30 combined instructor and TA hours for 36 students; at 100 the format is untenable. Voice AI and automated grading change the arithmetic. We built Viva, a system that conducts a personalized oral exam, then grades the transcript with a panel of three LLMs that score independently, read each other's assessments, and revise. Across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · AI in Service Interactions · Explainable Artificial Intelligence (XAI)
