Facilitating Holistic Evaluations with LLMs: Insights from Scenario-Based Experiments
Toru Ishida, Tongxi Liu, Hailong Wang, William K. Cheunga

TL;DR
This paper investigates using a Large Language Model to facilitate holistic evaluations in workshop courses by integrating diverse faculty assessments and explaining pedagogical theories, aiming to improve deliberation and scoring processes.
Contribution
It introduces a novel application of LLMs as facilitators for faculty evaluations, demonstrating their ability to synthesize diverse assessments and generate evaluation criteria.
Findings
LLMs effectively facilitate faculty discussions.
LLMs can generate evaluation criteria from single scenarios.
LLMs explain pedagogical theories to faculty.
Abstract
Workshop courses designed to foster creativity are gaining popularity. However, even experienced faculty teams find it challenging to realize a holistic evaluation that accommodates diverse perspectives. Adequate deliberation is essential to integrate varied assessments, but faculty often lack the time for such exchanges. Deriving an average score without discussion undermines the purpose of a holistic evaluation. Therefore, this paper explores the use of a Large Language Model (LLM) as a facilitator to integrate diverse faculty assessments. Scenario-based experiments were conducted to determine if the LLM could integrate diverse evaluations and explain the underlying pedagogical theories to faculty. The results were noteworthy, showing that the LLM can effectively facilitate faculty discussions. Additionally, the LLM demonstrated the capability to create evaluation criteria by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law
