Explainable Collaborative Problem Solving Diagnosis with BERT using SHAP and its Implications for Teacher Adoption
Kester Wong, Sahan Bulathwela, Mutlu Cukurova

TL;DR
This paper explores the explainability of BERT-based models for diagnosing collaborative problem solving in education using SHAP, highlighting the importance of transparency for teacher trust and the limitations of current explanations.
Contribution
It applies SHAP to analyze token contributions in BERT CPS classification, revealing issues with explanation quality and suggesting directions for improving model transparency and human-AI collaboration.
Findings
Well-performing models may lack reasonable explanations.
Certain tokens disproportionately influence classifications.
Spurious words can positively affect model decisions.
Abstract
The use of Bidirectional Encoder Representations from Transformers (BERT) model and its variants for classifying collaborative problem solving (CPS) has been extensively explored within the AI in Education community. However, limited attention has been given to understanding how individual tokenised words in the dataset contribute to the model's classification decisions. Enhancing the explainability of BERT-based CPS diagnostics is essential to better inform end users such as teachers, thereby fostering greater trust and facilitating wider adoption in education. This study undertook a preliminary step towards model transparency and explainability by using SHapley Additive exPlanations (SHAP) to examine how different tokenised words in transcription data contributed to a BERT model's classification of CPS processes. The findings suggested that well-performing classifications did not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
