Exploring Human-AI Complementarity in CPS Diagnosis Using Unimodal and Multimodal BERT Models

Kester Wong; Sahan Bulathwela; Mutlu Cukurova

arXiv:2507.14579·cs.CL·July 22, 2025

Exploring Human-AI Complementarity in CPS Diagnosis Using Unimodal and Multimodal BERT Models

Kester Wong, Sahan Bulathwela, Mutlu Cukurova

PDF

TL;DR

This paper investigates the use of unimodal and multimodal BERT models, particularly AudiBERT, for detecting collaborative problem solving indicators in dialogue, emphasizing statistical significance and human-AI complementarity.

Contribution

It extends prior work by demonstrating statistically significant improvements of AudiBERT in social-cognitive CPS classification and discusses strategies for leveraging human-AI complementarity with explainability.

Findings

01

AudiBERT improved classification of sparse classes in CPS detection.

02

Significant class-wise improvements were observed in social-cognitive dimensions.

03

Larger training data correlated with higher recall for both models.

Abstract

Detecting collaborative problem solving (CPS) indicators from dialogue using machine learning techniques is a significant challenge for the field of AI in Education. Recent studies have explored the use of Bidirectional Encoder Representations from Transformers (BERT) models on transcription data to reliably detect meaningful CPS indicators. A notable advancement involved the multimodal BERT variant, AudiBERT, which integrates speech and acoustic-prosodic audio features to enhance CPS diagnosis. Although initial results demonstrated multimodal improvements, the statistical significance of these enhancements remained unclear, and there was insufficient guidance on leveraging human-AI complementarity for CPS diagnosis tasks. This workshop paper extends the previous research by highlighting that the AudiBERT model not only improved the classification of classes that were sparse in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.